J 



Europaisches Patentamt 

European Patent Office ^ 

. . © Publication number: 0 200 421 B1 

Office europeen des brevets ^ 



© EUROPEAN PATENT SPECIFICATION 

© Date of publication of patent specification: 2a07.93 © Int. CL 5 :C12N 15/57, C12N 5/10, 

C12N 9/64, A61K 37/54, 
© Application number: 86302855.1 ^61 K 37/547 

© Date of filing: 16.04.86. 



© Expression of factor VII activity in mammalian cells. 



® Priority: 17.04.85 US 724311 


Seattle, WA 981 05 (US) 


16.12.85 US 810002 




© Date of publication of application: 


@ Inventor: Hagen, Frederick S. 


3835 - 44th N.E. 


10.12.86 Bulletin 86/45 


Seattle Washington 98105(US) 


© Publication of the grant of the patent: 


Inventor: Murray, Mark J. 


2211 -11th Avenue East 


28.07.93 Bulletin 93/30 


Seattle Washington 98102(US) 


© Designated Contracting States: 


Inventor: Busby, Sharon J. 


4109 Meridian N. 


AT BE CH DE FR GB IT LI LU NL SE 


Seattle Washington 98103(US) 


© References cited: 


Inventor: Berkner, Kathleen L 


3032 - 22nd Avenue W. 


WO-A-83/00016 


Seattle Washington 98199(US) 




Inventor: Insley, Margaret Y. 


PROCEEDINGS OF THE NATIONAL ACADEMY 


16860 N.E. 150th SL 


OF SCIENCES OF THE USA, vol. 83, no. 8, 


Woodlnvllle Washington 98072(US) 


April 1986, pages 2412-2416, Washington, US; 


Inventor: Woodbury, Richard G. 


F.S. HAGEN et a I.: "Characterization of a 


15464 -10th Avenue, N.E. 


cDNA coding for human factor VII' 1 


Seattle Washington 98155(US) 




Inventor: Gray, Charles L 


Trombosls Res., vol. 22 (1981), pages 


8014 -41 st Avenue, N.E. 


375-380 


Seattle Washington 98115(US) 


© Proprietor: ZYMOGENETICS, INC. 




4225 Roosevelt Way, N.E. 





CD 
O 

o 

CM 



CL 
LU 



Note: Within nine months from the publication of the mention of the grant of the European patent, any person 
may give notice to the European Patent Office of opposition to the European patent granted. Notice of opposition 
shall be filed in a written reasoned statement. It shall not be deemed to have been filed until the opposition fee 
has been paid (Art. 99(1) European patent convention). 



Rank Xerox (UK) Business Services 

13.10/3.6/3.3.11 



EP 0 200 421 B1 



© Representative: Brown, John David et al 
FORRESTER & BOEHMERT Franz- 
Joseph-Strasse 38 
W-8000 Munchen 40 (DE) 



2 



EP 0 200 421 B1 



Description 

Technical Field 

5 The present invention relates to blood coagulation factors in general, and more specifically, to the 
expression of proteins having biological activity for blood coagulation. 

Background Art 

w Blood coagulation is a process consisting of a complex interaction of various blood components or 
factors which eventually gives rise to a fibrin clot. Generally, the blood components which participate in 
what has been referred to as the coagulation "cascade" are proenzymes or zymogens, enzymatically 
inactive proteins which are converted to proteolytic enzymes by the action of an activator, itself an activated 
clotting factor. Coagulation factors which have undergone such a conversion are generally referred to as 

75 "activated factors," and are designated by the addition of a lower case postscript "a" (e.g., Vila). 

There are two separate systems which can promote blood clotting and thereby participate in normal 
haemostasis. These systems have been referred to as the intrinsic and the extrinsic coagulation pathways. 
The intrinsic pathway refers to those reactions which lead to thrombin formation through utilization of factors 
present only in plasma. An intermediate event in the intrinsic pathway is the activation of Factor IX to Factor 

20 IXa, a reaction catalyzed by Factor Xla and calcium ions. Factor IXa then participates in the activation of 
Factor X in the presence of Factor Villa, phospholipid and calcium ions. The extrinsic pathway involves 
plasma factors as well as components present in tissue extracts. Factor VII, one of the proenzymes referred 
to above, participates in the extrinsic pathway of blood coagulation by converting (upon its activation to Vila) 
Factor X to Xa in the presence of tissue factor and calcium ions. Factor Xa in turn then converts 

25 prothrombin to thrombin in the presence of Factor Va, calcium ions and phospholipid. Because the 
activation of Factor X to Factor Xa is an event shared by both the intrinsic and extrinsic pathways, Factor 
Vila can be used for the treatment of patients with deficiencies or inhibitors of Factor VIII (Thomas, U. S. 
Patent 4,382,083). There is also some evidence to suggest that Factor Vila may participate in the intrinsic 
pathway as well (Zur and Nemerson, J. Biol. Chem. 253: 2203-2209, 1978) by playing a role in the 

30 activation of Factor IX. ~~ 

Experimental analysis has revealed that human Factor VII is a single-chain glycoprotein with a 
molecular weight of approximately 50,000 daltons. In this form, the factor circulates in the blood as an 
inactive zymogen. Activation of Factor VII to Vila may be catalyzed by several different plasma proteases, 
such as Factor Xlla. Activation of Factor VII results in the formation of two polypeptide chains, a heavy 

35 chain (M r = 28,000) and a light chain (M r = 17,000), held together by at least one disulfide bond. Factor 
VII may also be activated to Vila in vitro, for example, by the method disclosed by Thomas in U.S. Patent 
No. 4,456,591. 

Factor IX circulates in the blood as a single-chain precursor of molecular weight 57,000 and is 
converted to an active serine protease (Factor IXa) upon cleavage by Factor Xla. Factor IXa consists of a 

40 light chain and a heavy chain of molecular weights 16,000 and 29,000, respectively. 

Current treatment practices for patients having coagulation disorders (e.g., deficiencies of Factor VIII 
and IX) generally involve replacement therapy with cryoprecipitate or other fractions of human plasma 
containing enriched levels of a particular factor. These preparations have heretofore been obtained from 
pooled human plasma, although the preparation of cryoprecipitates requires the use of a relatively large 

45 amount of human plasma as starting material. 

Therapeutic uses of Factor VII exist in the treatment of individuals exhibiting a deficiency in Factor VII, 
as well as Factor VIII and Factor IX deficient populations, and individuals with Von Willebrand's disease. 
More specifically, individuals receiving Factors VIII and IX in replacement therapy frequently develop 
antibodies to these proteins. Continuing treatment is exceedingly difficult because of the presence of these 

so antibodies. Patients experiencing this problem are normally treated with an activated prothrombin complex 
known to consist of a mixture of active and inactive clotting enzymes, including Factor Vila. Further, recent 
studies indicate that small amounts (40-50 micrograms) of injected Factor Vila are effective in controlling 
serious on-going bleeding episodes in Factor VIII deficient patients who have high levels of antibody in their 
blood (Hedner and Kisiel, J. Clin. Invest. 71: 1836-1841, 1983). 

55 Due to the diverse sources of the plasma used in the preparation of cryoprecipitates, it is difficult to test 
the preparations to ensure that they are free of viral contamination. For instance, essentially all recipients of 
cryoprecipitate show a positive test for hepatitis. Recent reports have also indicated that some hemophiliacs 
receiving cryoprecipitate have developed acquired immune deficiency syndrome (AIDS). In addition, the 
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purification of large amounts of these factors is extremely difficult and expensive. 

Consequently, there exists a need in the art for a method of producing relatively large quantities of pure 
preparations of Factor Vila and Factor IX. The present invention fulfills this need through the use of 
recombinant DNA technology, successfully eliminating the problem of viral contamination and, at the same 
5 time, providing a consistent and homogenous source of active Factor Vila to treat Factor VII and Factor IX 
deficient patients and individuals with Von Willebrand's disease. 

Disclosure of the Invention 

70 Briefly stated, the present invention discloses a DNA construct containing a nucleotide sequence 
encoding human factor VII having an amino acid sequence as shown in Figure 1b. The nucleotide sequence 
may comprise a first nucleotide sequence derived from a cDNA or a genomic clone of Factor VII joined to a 
second nucleotide sequence positioned downstream of the first sequence, said second nucleotide sequence 
derived from a cDNA clone of of Factor VII. The joined sequences code for a protein which upon activiation 

75 has Factor VII a biological activity for blood coagulation. Further, the first nucleotide sequence may also 
encode a leader peptide and may also include a double-stranded oligonucleotide. 

In addition, the present invention discloses recombinant plasmids capable of integration in mammalian 
host cell DNA. One of the plasmids includes a promoter followed downstream by a set of RNA splice sites, 
the RNA splice sites being followed downstream by a nucleotide sequence which codes for Factor VII. The 

20 sequence codes for a protein which upon activation has Factor Vila biological activity for blood coagulation. 
The nucleotide sequence is then followed downstream by a polyadenylation signal. 

Similar to the recombinant plasmid noted above, the present invention also discloses a second plasmid 
which includes a promoter followed downstream by a set of RNA splice sites, the RNA splice sites being 
followed downstream by a nucleotide sequence which codes at least partially for Factor IX. The nucleotide 

25 sequence comprises a first nucleotide sequence which encodes a calcium binding domain joined to a 
second nucleotide sequence positioned downstream of the first sequence. The second nucleotide sequence 
encodes a catalytic domain for the serine protease activity of Factor IX. The joined sequences code for a 
protein having substantially the same biological activity for blood coagulation as Factor IX. The nucleotide 
sequence is then followed downstream by a polyadenylation signal. 

30 A third aspect of the invention discloses mammalian cells stably transfected to produce a protein having 
susbtantially the same biological activity, upon activation, as Factor Vila. The ceils are transfected with a 
DNA construct containing a nucleotide sequence which codes for Factor VII. The sequence codes for a 
protein which, upon activation, has Factor Vila biological activity for blood coagulation. 

The present invention further provides for a method of producing a protein having biological activity for 

35 blood coagulation mediated by Factor Vila through establishing a mammalian host cell which contains a 
DNA construct containing a nucleotide sequence which codes for Factor VII. The sequence codes for a 
protein which, upon activation, has Factor Vila biological activity for blood coagulation. Subsequently, the 
mammalian host is grown in an appropriate medium containing vitamin K and the protein product encoded 
by the DNA construct and produced by the mammalian host cell is isolated. The protein product is then 

40 activated to generate Factor Vila. 

Yet another aspect of the present invention discloses a DNA construct comprising a DNA sequence 
encoding Factor VII having an amino acid sequence as shown in Figure 1b. In a preferred embodiment, the 
DNA sequence comprises the cDNA sequence of Figure 1b from bp 36 to bp 1433. In another preferred 
embodiment, the DNA sequence comprises the cDNA sequence of Figure 1b from bp 36 to bp 99, followed 

45 downstream by the sequence from bp 166 to bp 1433. Recombinant plasmids capable of integration in 
mammalian host cell DNA comprising the DNA sequences described immediately above are also disclosed. 

Mammalian cells stably transfected with a recombinant plasmid comprising a DNA sequence encoding 
Factor VII having an amino acid sequence as shown in Figure 1b are also disclosed. In preferred 
embodiments, the DNA sequence comprises the cDNA sequence of Figure 1b from bp 36 to bp 1433, or 

so the cDNA sequence of Figure 1b, from bp 36 to bp 99, followed downstream by the sequence from bp 166 
to bp 1433. 

A method for producing a protein having biological activity for blood coagulation mediated by Factor 
Vila through establishing a mammalian host cell that contains a DNA construct as described above is also 
disclosed. The mammalian host cell is subsequently grown in an appropriate medium containing vitamin K, 
55 and the protein product encoded by the DNA construct is isolated. The protein product is then activated to 
generate Factor Vila. 

Other aspects of the invention will become evident upon reference to the following detailed description 
and attached drawings. 
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Brief Description of the Drawings 

Figure 1a illustrates the partial Factor VII cDNA sequence produced by joining portions of cDNA clones 
\VII2115and X VIM 923. 

5 Figure 1b illustrates the Factor VII cDNA sequence of XVII2463. Arrows indicate the extent of the 
deletion in the sequence of XVII565. Numbers above the sequence designate amino acids. Numbers below 
designate nucleotides. 

Figure 2a illustrates the amino acid sequences of the amino terminal regions of several clotting factors. 
Figure 2b illustrates a comparison of the amino acid sequence of Factor VII obtained from protein 
70 sequencing with that encoded by the cDNA. 

Figure 3 illustrates the joining of Factor IX leader sequences to a sequence encoding a consensus 
calcium binding domain. 

Figure 4 illustrates the joining of the Factor IX-consensus sequence hybrids to a partial Factor VII cDNA 
to produce an in-frame coding sequence. 
75 Figure 5 illustrates the construction of a plasmid containing a coding sequence for a Factor IX/Factor VII 
fusion protein. 

Figure 6 illustrates the expression vector FIX7VII/pD2. Symbols used are Ad2 MLP, the major late 
promoter from adenovirus 2; L1-3, the adenovirus 2 tripartite leader sequence; 5'ss, 5' splice site 1 3'ss, 3' 
splice site' and pA, the late polyadenylation signal from SV40. 
20 Figure 7 illustrates the nucleotide sequence of a Factor IX/Factor VII cDNA fusion. 

Figure 8 illustrates expression vector pM7135. Symbols used are E, the SV40 enhancer; ori, the 0-1 
map units Ad 5; pA, the early polyadenylation signal from SV40; A, the deletion region of the pBR322 
"poison" sequences; and other symbols as described for Figure 6. 

Figure 9 illustrates the subcloning of the 2463bp Factor VII cDNA. 
25 Figure 10 illustrates the subcloning of the 565bp Factor VII cDNA. 

Figure 11 illustrates the joining of the 5' end of pVII565 and the 3 f portion of pVII2463in pUC18 to 
generate pVII2397. 

Figure 12 illustrates the construction of the expression plasmids FVII(2463)/pDX and FVII(565 + 2463)- 
/pDX. pA denotes the polyadenylation signal from SV40 in early or late orientation, as described in Example 
30 9. Other symbols are as described for Figure 8. 

Best Mode for Carrying Out the Invention 

Prior to setting forth the invention, it may be helpful to an understanding thereof to set forth definitions 
35 of certain terms to be used hereinafter. 

Complementary DNA or cDNA: A DNA molecule or sequence which has been enzymatically syn- 
thesized from the sequences present in a mRNA template. 

DNA Construct: A DNA molecule, or a clone of such a molecule, either single- or double-stranded, 
which may be isolated in partial form from a naturally occurring gene or which has been modified to contain 
40 segments of DNA which are combined and juxtaposed in a manner which would not otherwise exist in 
nature. 

Plasmid or Vector: A DNA construct containing genetic information which may provide for its replication 
when inserted into a host cell. A plasmid generally contains at least one gene sequence to be expressed in 
the host cell, as well as sequences which facilitate such gene expression, including promoters and 
45 transcription initiation sites. It may be a linear or closed circular molecule. 

Joined : DNA sequences are said to be joined when the 5' and 3' ends of one sequence are attached, 
by phosphodiester bonds, to the 3* and 5* ends, respectively, of an adjacent sequence. Joining may be 
achieved by such methods as ligation of blunt or cohesive termini, by synthesis of joined sequences 
through cDNA cloning, or by removal of intervening sequences through a process of directed mutagenesis. 
50 Leader Peptide : An amino acid sequence which occurs at the amino terminus of some proteins and is 
generally cleaved from the protein during subsequent processing and secretion. Leader peptides comprise 
sequences directing the protein into the secretion pathway of the cell. As used herein, the term "leader 
peptide" may also mean a portion of the naturally occurring leader peptide. 

Domain : A three-dimensional, self-assembling array of specific amino acids in a protein molecule which 
55 contains all or part of the structural elements necessary for some biological activity of that protein. 

Biological Activity : A function or set of functions performed by a molecule in a biological context (i.e., in 
an organism or an in vitro facsimile). Biological activities of proteins may be divided into catalytic and 
effector activities. Catalytic activities of clotting factors generally involve the activation of other factors 
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through the specific cleavage of precursors. Effector activities include specific binding of the biologically 
active molecule to calcium or other small molecules, to macromolecules such as proteins, or to cells. 
Effector activity frequently augments, or is essential to, catalytic activity under physiological conditions. 
Catalytic and effector activities may, in some cases, reside within the same domain of a protein. 

5 For Factor Vila, biological activity is characterized by the mediation of blood coagulation through the 
extrinsic pathway. Factor Vila activates Factor X to Factor Xa, which in turn converts prothrombin to 
thrombin, thereby initiating the formation of a fibrin clot. Because the activation of Factor X is common to 
both the extrinsic and intrinsic pathways of blood coagulation, Factor Vila may be used to treat individuals 
severely deficient in the activities of Factor IX, Factor VIII or Von Willebrand Factor. 

w As noted above, the isolation of Factor VII from human plasma is a time-consuming and expensive 
process since the factor is a rare protein present only at a concentration of approximately 300 micrograms 
per liter of blood. In addition, it is difficult to separate from prothrombin, Factor IX and Factor X and is 
susceptible to proteolytic attack during purification (Kisiel and McMullen, ibid). Although single-chain human 
Factor VII has been purified to homogeneity (Kisiel and McMullen, ibid), the published purification methods 

75 are generally limited by low yield and/or contamination by other coagulation factors. 

Factor VII is produced in the liver and requires vitamin K for its biosynthesis. Vitamin K is necessary for 
the formation of specific gam ma-carboxy glutamic acid residues in the factors. These unusual amino acid 
residues, which are formed by a post-translational modification, bind to calcium ions and are responsible for 
the interaction of the protein with phospholipid vesicles. In addition, Factor VII contains one 0-hydroxyaspar- 

20 tic acid residue which is also formed after the protein has been translated. However, the role of this amino 
acid residue is not known. 

Given the fact that the activity of Factor VII is dependent upon post-translational modifications involving 
the gamma carboxylation of specific glutamic acid residues, and may also be dependent upon the 
hydroxylation of a specific aspartic acid residue, it is unlikely that an active product could be produced 

25 through the cloning and expression of Factor VII in a microorganism. 

Accordingly, the present invention provides a method of producing a protein having biological activity 
for blood coagulation mediated by Factor Vila using stably transfected mammalian cells. 

As noted above, Factor VII requires vitamin K for its biosynthesis. In addition, the plasma proteins 
prothrombin, Factor IX, Factor X, Protein C, and Protein S also require vitamin K for their biosynthesis. The 

30 amino-terminal portions of these proteins, which contain gamma-carboxyglutamic acid residues, are ho- 
mologous in both amino acid sequence and in biological function (Figure 2a). Further, the carboxy-terminal 
portions of Factor VII, prothrombin, Factor IX, Factor X, and Protein C determine their specific serine 
protease functions. 

Factor VII is a trace plasma protein, and the mRNA encoding Factor VII is believed to be rare. 

35 Consequently, purification of Factor VII from plasma in sufficient quantities to permit extensive sequence 
analysis and characterization remains difficult. Degradation of Factor VII during purification, even in the 
presence of protease inhibitors, was noted by Kisiel and McMullen (ibid). Due to these difficulties, Factor VII 
has been poorly characterized, compared to other more abundant components of the blood coagulation 
system. Indeed, the work of Kisiel and McMullen (ibid) yielded sequence information for only 10 residues of 

40 each chain of Factor VII, and in each sequence the identification of two residues was tentative. Partial amino 
acid sequence data for Bovine Factor VII have also been published (DiScipio et al., ibid). 

The presumed rarity of Factor VII mRNA has contributed to the lack of knowledge of the Factor VII 
gene. The success of conventional cDNA cloning techniques is dependent on a sufficient quantity of mRNA 
for use as a template. Premature termination of reverse transcription results in the production of cDNA 

45 clones lacking the 5' end and this condition is exacerbated by low mRNA levels. Several strategies for 
cDNA cloning of low abundance message have been developed (Maniatis et al., Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbor Laboratory, 1982), but a lack of knowledge of the amino acid 
sequence of the product of interest makes it impossible to predict the DNA sequence and to design 
appropriate oligonucleotide probes. While it may be relatively straightfoward to obtain a partial cDNA clone 

50 of a gene encoding a rare protein by using these advanced strategies, full-length cDNA clones of genes 
encoding rare proteins such as Factor VII remain exceedingly difficult to obtain. 

In comparison to Factor VII, Factor IX is a relatively abundant protein and the sequence of a cDNA 
clone of the human Factor IX gene is known (Kurachi and Davie, Proc. Natl. Acad. Sci. USA 79: 6461-6464, 
1982; and Anson et al., EMBO J. 3: 1053-1060, 1984). The structure of the Factor IX gene has been 

55 characterized and the amino acid sequence of the protein has been determined on the basis of the known 
nucleotide sequence. Some protein sequence data have also been published for human and bovine Factor 
IX and the sequences analyzed (DiScipio et al., ibid). The amino terminal portion of the protein contains 12 
glutamic acid residues that are converted to 7-carboxyglutamic acid (Gla) residues in the mature protein. 
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The cleavage sites involved in the activation of Factor IX have also been identified (Kurachi and Davie, ibid). 
A sequence at the 5' end of the Factor IX cDNA clone codes for a signal peptide which is typical of those 
found in most secreted proteins (Kurachi and Davie, ibid). The expression of the Factor IX gene through 
recombinant DNA methods has not been previously reported. 

s Because of the difficulty in obtaining a full-length cDNA clone of the Factor VII gene, three novel 
approaches were adopted to supply the 5' end of the coding sequence, including the region encoding the 
leader peptide. According to the first method, a partial cDNA clone for Factor VII is joined to a fragment 
encoding the leader peptide and 5' portion of Factor IX. This approach is based on the observation that the 
amino-terminal portions of the two molecules are responsible for the calcium binding activities of the 

io respective proteins and the discovery that the calcium binding activity of Factor IX can substitute for that of 
Factor VII. The resultant polypeptide retains the biological activity of authentic Factor VII because the 
specific serine protease activities of the coagulation factors reside in the carboxy-terminal regions of the 
molecules. The second approach combines the partial cDNA clone with a DNA sequence encoding the 
leader and amino-terminal regions of Factor VII. The. partial cDNA and amino acid sequences of Factor VII 

75 disclosed herein enable the screening of a genomic DNA library or cDNA library for clones comprising the 
5' portion of the Factor VII gene. The third approach involves joining the partial cDNA clone to hybrid 
coding sequences comprising a cDNA fragment encoding the leader peptide of Factor IX and a synthetic 
gene segment encoding a consensus calcium binding domain or a predicted amino terminal sequence for 
Factor VII. The coding sequence for the amino terminus of Factor VII was established through previously 

20 unpublished amino acid sequence data disclosed herein. The consensus sequence was derived from the 
factor VII data and published sequence data for other vitamin K-dependent plasma proteins. 

Consistent with the approach described above for screening for clones comprising the 5* portion of the 
Factor VII gene, the inventors have been successful in obtaining a full-length, correct cDNA that is suitable 
for expression. 

25 Among the cDNA clones that were generated, a clone designated "XVI 12463" contained the largest 
Factor VII cDNA insert. It was found to contain the entire coding sequence for Factor VII. This clone 
included a 35 nucleotide 5' untranslated region, 180 nucleotides coding for a 60 amino acid leader, 1218 
nucleotides coding for the 406 amino acid mature protein, a stop codon, 1026 nucleotides of 3' untranslated 
sequence, and a 20 base poly(A) tail (beginning at position 2463). This cDNA has now been sequenced in 

30 its entirety on both strands. A comparison of it with two cDNA inserts isolated earlier from clones XVII21 1 5 
and XVII1 923, revealed that XVII2463 contains, on a single EcoRI fragment, a Factor VII cDNA coding for 
Factor VII leader and mature protein sequences. 

A second clone, XVII565, was isolated that contained a cDNA insert that was identical to the cDNA of 
clone XVII2463 from nucleotide 9 to nucleotide 638, except that it lacked nucleotides 100 to 165 (Figure 

35 1b). In comparing the cDNAs to Factor VII genomic DNA, the absent sequences correspond precisely to 
one exon-like region. Therefore, two Factor VII cDNAs have been obtained which appear to reflect 
alternative mRNA splicing events. 

The leader encoded by XVII2463 is exceptionally long (60 amino acids) and has a very different 
hydrophobicity profile when compared with Factor IX, protein C and prothrombin. This leader contains two 

40 mets, at positions -60 and -26. Initiation most likely beigns at the first met, since a hydrophobic region, 
typical of signal peptides, follows the met at position -60, but not the met at -26. It is interesting that the 
absent sequence in XVII565, which corresponds precisely to an exon-like region in the genomic clone, 
results in a 38 amino acid leader with a hydrophobicity pattern more analogous to Factor IX, protein C, and 
prothrombin. 

45 Since it was not clear then which, if either, of the leaders described above was authentic, an additional 
approach was initiated in an effort to analyze the 5* end sequence. Briefly, this approach included the 
construction and screening of a human genomic DNA library, and the identification of genomic clones 
comprising Factor VII gene sequences. The 5' portion of the genomic sequence was subseqently joined to 
the cDNA to construct a full-length clone. 

so In an additional construct, a 5' Factor VII cDNA fragment of XVII565 containing ail of the leader and 29 
amino acids of the mature coding sequence was ligated to a fragment of the cDNA of XVII2463 (containing 
the remainder of the mature protein and 3'-untranslated sequences). This "565-2463" sequence encodes a 
full-length Factor VII cDNA sequence as a single EcoRI fragment. 

The DNA sequences described above are then inserted into a suitable expression vector which is in 

55 turn used to transfect a mammalian cell line. Expression vectors for use in carrying out the present 
invention will comprise a promoter capable of directing the transcription of a foreign gene in a transfected 
mammalian cell. Viral promoters are preferred due to their efficiency in directing transcription. A particulary 
preferred such promoter is the major late promoter from adenovirus 2. Such expression vectors will also 
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contain a set of RNA splice sites located downstream from the promoter and upstream from the insertion 
site for a gene encoding a protein having biological activity for blood coagulation. Preferred RNA splice site 
sequences may be obtained from adenovirus and/or immunoglobulin genes. Also contained in the expres- 
sion vectors is a polyadenylation signal, located downstream of the insertion site. Viral polyadenylation 
5 signals are preferred, such as the early or late polyadenylation signals from SV40 or the polyadenylation 
signal from the adenovirus 5:Elb region. In a particularly preferred embodiment, the expression vector also 
comprises a viral leader sequence, such as the adenovirus 2 tripartite leader, located between the promoter 
and the RNA splice sites. Preferred vectors may also include enhancer sequences, such as the SV40 
enhancer. 

70 Cloned DNA sequences may then be introduced into cultured mammalain cells by calcium phosphate 
mediated transfection. (Wigler et al„ Cell 14: 725, 1978; Corsaro and Pearson, Somatic Cell Genetics 7: 
603, 1981; Graham and Van der EbrvTrology 52: 456, 1973.) A precipitate is formed of the DNA and 
calcium phosphate and this precipitate is appliedlo the cells. A portion of the cells take up the DNA and 
maintain it inside the cell for several days. A small fraction of the cells (typically 10~ 4 ) stably integrate the 

75 DNA into the genome. In order to identify these stable integrants, a gene that confers a selectable 
phenotype (a selectable marker) is generally introduced along with the gene of interest. Preferred selectable 
markers include genes that confer resistance to drugs, such as G-418 and methotrexate. Selectable 
markers may be introduced into the cell on a separate plasmid at the same time as the gene of interest or 
they may be introduced on the same plasmid. A preferred selectable marker is the gene for resistance to 

20 the drug G-418, which is carried on the plasmid pKO-neo (Southern and Berg, J. Mol. Appl. Genet. 1: 327- 
341, 1982). It may also be advantageous to add additional DNA, known as "carrier DNA," to the mixture 
which is introduced into the cells. After the cells have taken up the DNA, they are allowed to grow for a 
period of time,, typically 1-2 days, to begin expressing the gene of interest. Drug selection is then applied 
to select for the growth of cells which are expressing the selectable marker in a stable fashion. Clones of 

25 such cells may be screened for expression of the protein of interest. 

Factor VII produced by the transfected cells may be removed from the cell culture media by adsorption 
to barium citrate. Spent medium is mixed with sodium citrate and barium chloride and the precipitate 
collected. The precipitated material may then be assayed for the presence of the appropriate clotting factor. 
Further purification may be achieved through immunoadsorption. It is preferred that the immunoadsorption 

30 column comprise a high-specificity monoclonal antibody. Alternatively, purification of the barium citrate 
precipitated material may be accomplished by more conventional biochemical methods or by high- 
performance liquid chromatography. 

Conversion of single-chain Factor VII to active two-chain Factor Vila may be achieved using Factor Xlla 
as described by Hedner and Kisiel (J. Clin. Invest. 71: 1836-1841, 1983), or with other proteases having 

35 trypsin-like specificity (Kisiel and Fujikawa, Behring Inst Mitt. 73: 29-42, 1983). 

In summary, the present invention provides a method forThe production of proteins having the activity 
of Factor Vila using transfected mammalian cells. Gene sequences encoding the specific serine protease 
domain of Factor Vila are isolated from cDNA libraries. Sequences encoding the leader peptide and calcium 
binding domains are isolated from cDNA or genomic libraries or constructed from synthesized 

40 oligonucleotides. The sequences are then joined in an appropriate expression vector so as to encode Factor 
VII. The resulting vector and a plasmid containing a drug resistance marker are co-transfected into 
appropriate mammalian tissue culture cells. Transfected cells may then be selected by addition of the 
appropriate drug, such as G-418. The protein products are then purified from the cell growth media and 
assayed for biological activity in a blood coagulation assay and for immunological cross-reactivity using 

45 antibodies prepared against authentic human Factor VII. 

To summarize the examples which follow, Example 1 discloses the cloning of a full-length cDNA 
sequence for Factor VII. Example 2 discloses a partial amino acid sequence of human Factor VII, including 
the sequence of approximately 30 amino acids at the amino terminus. Example 3 discloses the construction 
and screening of a human genomic DNA library and the identification of genomic clones comprising Factor 

so VII gene sequences. Example 4 discloses the construction of two hybrid gene segments, each comprising a 
cDNA fragment encoding the leader peptide of Factor IX and a synthesized double-stranded fragment 
encoding a consensus calcium binding domain. The hybrid sequences are then joined to partial cDNA 
clones of Factor VII. Using in vitro mutagenesis, the consensus sequence was then altered to conform to 
the protein sequence data lor Factor VII. Example 5 describes the construction of a gene sequence 

55 encoding a fusion protein comprising the calcium binding domain of Factor IX and the specific serine 
protease domain of Factor VII. Example 6 describes the construction of the vector pD2 for use in 
expressing proteins having biological activity for blood coagulation in transfected mammalian cells. The 
gene fusion described in Example 5 is expressed using this vector. Example 7 describes the use of the 



8 



EP 0 200 421 B1 



vector pD2 to express a gene for Factor IX in a transfected mammalian cell line. Example 8 describes the 
construction of the vector pM7135, which contains DNA sequences encoding a primary translation product 
comprising the leader sequence of Factor IX fused to Factor VII. This vector may be used to produce a 
protein having the activity of Factor VII in a transfected mammalian cell line. Example 9 describes the 
5 expression of Factor VII using cDNA sequences, and the expression of Factor VII from a genomic-cDNA 
hybrid sequence. 

The following examples are offered by way of illustration and not by way of limitation. 
EXAMPLES 

10 

Restriction enzymes were obtained from Bethesda Research Laboratories (BRL) and New England 
Biolabs and were used as directed by the manufacturer, unless otherwise noted. Oligonucleotides were 
synthesized on an Applied Biosystems Model 380 A DNA synthesizer and purified by polyacrylamide gel 
electrophoresis on denaturing gels. E. coli cells were transformed as described by Maniatis et al. (Molecular 
75 Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory, 1982). M13 and pUC cloning vectors and 
host strains were obtained from BRL. Factor VII was prepared from human plasma as described by Kisiel 
and McMullen (ibid). 

Example 1: Cloning of a Partial Factor VII cDNA. 

20 ————— — 

A. Construction of a human liver cDNA library. 

A cDNA library was prepared from human liver mRNA by the method of Chandra et al., Proc. Natl. 
Acad. Sci. U.S.A.80: 1845-1848, 1983. The cDNA preparation was sedimented through an alkaline sucrose 

25 gradient (Monahan et al., Biochemistry 15: 223-233, 1976) and fractions containing species of greater than 
about 1000 nucleotides were pooled. The" first strand preparation was made double-stranded using reverse 
transcriptase (Chandra et al., 1983), treated with S1 nuclease, and the residual staggered ends filled-in 
using DNA Polymerase I (Klenow fragment) in the presence of all four deoxyribonucleotide triphosphates 
(Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 1982). The blunt- 

30 ended cDNA was treated with Eco Rl methylase and ligated to phosphorylated Eco Rl linkers using T* DNA 
ligase (Maniatis et al., ibid). The ligated DNA preparation was exhaustively digested with Eco Rl to remove 
excess linker sequences and double-stranded DNAs greater than about 1000 base pairs in length were 
purified by neutral sucrose gradient centrifugation (Maniatis et al., ibid). Native Xgt1 1 DNA was ligated into 
concatemers, digested to completion with Eco Rl, and the 5* terminal phosphates were removed by 

35 treatment with bacterial alkaline phosphatase. The pooled human liver cDNA was ligated with the phage 
DNA, packaged in vitro (Maniatis et al., ibid), and used to infect E. coli Y1088 (Young and Davis, Science, 
222 : 778-782, 1983). Approximately 14 x 10 6 primary phage "plaques were generated in this library, 
composed of seven libraries of ~ 2 x 10 6 plaques each. Greater than 90% of these were recombinants 
containing human DNA inserts, based on their lack of 0-galactosidase activity and characterization of 20 

40 random clones by Eco Rl digestion followed by agarose gel electrophoresis. The cDNA library, in the form 
of phage particles, was purified by cesium chloride gradient centrifugation and stored in SM buffer (Maniatis 
et al., ibid). 

B. Screening of the human liver cDNA library for Factor VII clones. 

45 

The human liver expression cDNA library described above was screened for specific antigen (Young 
and Davis, ibid) using an 125 Mabeled monoclonal Factor VII antibody prepared by the method of Brown et 
al. ( J. Biol. Chem. 225 : 4980-4983, 1980) using purified Factor VII. Screening of 6 x 10 s phage plaques 
identified one isolate, designated XVII2115, which gave a positive response with the antibody, 
so The phage clone XVII2115 was tested against two other anti-Factor VII monoclonal antibodies and a 
rabbit polyclonal antibody to Factor VII. Isolate XVII21 15 gave a positive response to all these anti-Factor VII 
antibodies. 

DNA was prepared from a plate lysate (Maniatis et al., pp. 65-66, 1982) of XVII2115. Digestion of this 
DNA with Eco Rl liberated an insert of 2139 base pairs. This insert was subcloned into M13 phage vectors 
55 (Messing, Meth. in Enzymology 101 : 20-77, 1983; and Norrander et al., Gene 26: 101-106, 1983) for chain 
termination dideoxy DNA sequencing (Sanger et al., Proc. Natl. Acad. Sci. LLS.A. 74: 5463-5476, 1977). 
This cDNA insert contains Pst I sites at positions 214, 839, and 1205 (designated Psfla, Pst lb, and Pst Ic, 
respectively, in Figure 1a) and a Sma I site located at position 611. The following M13 templates were 
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sequenced: 

1) full-length (2139 bases) Eco Rla — Eco Rib fragment in M13mp18 (designated clone F7-1); 

2) Pst la — Eco Rla 214 base fragment in M13mp19 (F7-2); 

3) Pst la — Pst lb 625 base fragment in M13mp18 (F7-3); 
5 4) Pst lb — Pst la 625 base fragment in M13mp18 (F7-7); 

5) Sma I — Pst lb 228 base fragment in M13mp10 (F7-8); 

6) Pst lb — Pst Ic 366 base fragment in M13mp18 (F7-9); 

7) Pst Ic — Pst lb 366 base fragment in M13mp18 (F7-10); 

8) Pst Ic — Eco Rib 930 base fragment in M13mp19 (F7-11); and 
w 9) Eco Rib — Eco Rla full-length fragment in M13mp18 (F7-12) 

(restriction site designations refer to Figure 1a). 
The data confirmed the sequence on both strands for 91% of the coding region and 15% of the 3' non- 
coding region and yielded single-stranded sequence information for the remaining 9% of the coding region 
and 85% of the non-coding region. 

75 Comparison of the amino acid sequence predicted from the cDNA sequence with the known amino acid 
sequence data of Kisiel and McMullen (Thrombosis Research 22: 375, 1981) and the amino acid sequence 
shown below (Example 2) revealed an anomaly which could be explained by the absence of three 
nucleotides in the DNA sequence near position 400. To obtain additional sequence data, XVII2115 was 
digested with Eco Rl, and the Factor VII coding fragment was inserted into pUC 13 (Vieira and Messing, 

20 Gene 19: 259-268, 1982; and Messing, ibid) which had been digested with Eco Rl. The resultant 
recombinant plasmid, designated pUCVII2115, was digested with Xba I which cut at position 328. The 
digested sample was divided in half: half was labeled with a 32 P dCTP and DNA Polymerase I (Klenow 
fragment) (Englund, P.T., J. Mol. Bio . 66: 209, 1972); the other half was labeled with 7 32 P ATP and 
polynucleotide kinase (Chaconas et al.r"Biochem. Biophys. Res. Comm. 66: 962, 1975). The labeled 

25 plasmids were then recut with Pst I to yield 113 and 509 base pair fragments. "Both strands of each of these 
were sequenced by the method of Maxam and Gilbert (Meth. in Enzymology 74: 560, 1980). The 113 base 
pair fragment was sequenced in its entirety and 210 base pairs of the 509 base pair fragment were 
sequenced. These sequences revealed three additional bases (one C and two G's) which rendered the DNA 
sequence data in agreement with the protein sequence data, indicating that the previous anomalous results 

30 arose from compressions on the sequencing gel due to secondary structure involving G's and C's. The 
sequence of the last 9% of the coding region on both strands was also confirmed. 

Further analysis of the sequence of the pUCVII2115 insert confirmed that a portion of this cloned 
fragment encoded a sequence of 11 amino acids known to be at the cleavage site of Factor VII (Kisiel and 
McMullen, Thrombosis Research 22: 375, 1981). Comparison of this sequence to Factor IX (Davie et al., 

35 ibid) and Factor X (Leytus et al., Proc. Natl. Acad. Sci. U.S.A. 81: 3699-3702, 1984) amino acid sequences 
suggested that the clone contained the sequence for Factor VII beginning at (approximately) nucleotides 
coding for amino acid 36 of the mature Factor VII protein and continuing through approximately 1000 
coding and 1100 noncoding nucleotides and poly A sequence. In addition, it was found that this clone had 
frameshift mutations in the 3* coding portion. 

40 In order to obtain the correct 3* coding region, all 14 million clones of the seven Xgt11 cDNA libraries 
were screened by plaque hybridization (Benton and David, Science 196: 180-181, 1977) with nick-translated 
cDNA of XVII2115 (Maniatis et al., pp. 109-112, 1982). " 

Seven positive isolates were then screened by dideoxy sequencing of pUC plasmids into which the 
cDNA inserts had been subcloned (Wallace et al., Gene 16: 21, 1981). The \gt11 clones were digested with 

45 Eco Rl and the Factor VII fragments were inserted into~pUCl3 which had been cleaved with Eco Rl. All 
except one of these were found to start at a position corresponding to base 212 of the insert in XVII2115; 
the one exception consisted only of 3' non-coding sequence. One of the clones starting at base 212 was 
selected for analysis and was designated clone pUCVII1923. 

Because analysis of pUCVII2115 indicated the presence of frameshift mutations between positions 657 

so and 815, pUCVII1923 was first analyzed in this region by Maxam-Gilbert sequencing. Plasmid pUCVII1923 
was digested with Nar I (position 779 in Figure 1a). The cut DNA was labeled with a 32 P dCTP using DNA 
polymerase I (Klenow fragment) and subsequently digested with Ava I (which cleaves at the same site as 
Sma I in Figure I) and Taq I (site at 1059), yielding a Nar l-Ava I 166 bp fragment and a 200 bp Nar l-Taq I 
fragment. Each of these was sequenced. A C, missing in pUCVII2115, was found at position 697 and 

55 another C, also missing in pUCVII2115, was found at postition 798. 

The rest of the sequence of the coding region of pUCVII1923 was shown to be correct by sequencing 
by the dideoxy method on an M13 subclone of the entire insert of pUCVII1923. The Lac primer ZC87 (Table 
1) was used to sequence from position 212 (Figure 1a) to 512; primer ZC218 (CTCTGCCTGCCGAAC) was 
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used to sequence from 715 to 1140 and primer ZC217 (ATGAGAAGCGCACGAAG) was used for sequen- 
cing from 720 to 350. Since the pUCVII2115 insert is correct from position 13 (positions 1-12 include an 
artificial linker) to 695, and pUCVII1923 is correct from position 212 to the end, the two were spliced 
together to yield a molecule correct from position 13 (Figure 1a) to the end. A convenient point utilized for 
5 this splice is the Xba I site at position 328. The sequence of the spliced corrected molecule is shown in 
Figure 1a. 

Because a full-length Factor VII clone was difficult to obtain by cDNA cloning, three strategies were 
adopted to provide the missing coding sequence and the necessary upstream processing and signal 
sequences. The first strategy was to obtain the needed sequence from a human genomic DNA library or 

w through additional screening of cDNA libraries. The second approach was to synthesize the necessary 5' 
coding sequence, based on the amino acid sequence data for Factor VII (Example 2) and the published 
sequences of the genes encoding vitamin K-dependent clotting factors (Kurachi and Davie, ibid; and Davie 
et al., ibid), and join this to a portion of the prepro sequence of Factor IX. The third strategy relies on the 
functional homology of the amino terminal regions of Factor VII and Factor IX. A sequence was constructed 

75 which comprised the coding regions for the leader and amino-terminal portion of Factor IX. This was then 
fused in the proper orientation to the partial Factor VII cDNA. 

In order to obtain DNA sequences that comprise the entire DNA sequence of Factor VII, an attempt was 
made to isolate the remaining 5* DNA sequence. This was accomplished through the utilization of the 5* 
terminal 0.3 kb EcoRI-Xbal fragment from the cDNA insert of XVII2115 to screen a cDNA library comprising 

20 2 x 10 6 phage. The library was constructed using poly(A) mRNA from HepG2 cells following an adaptation 
of the method of Gubler and Hoffman (Gene 25: 263-269, 1983). The RNA was reverse transcribed to 
generate first strand cDNA, followed by seconcTstrand synthesis using DNA polymerase I and RNase H. 
Following EcoRI methyiation and passage over a Sepharose® 6B column, the DNA termini were blunted 
with T* DNA polymerase. EcoRI linkers were added and excess linkers were removed by digestion with 

25 EcoRI and chromatography on Sepharose® CL 2B. The DNA in the void volume was collected and ligated 
to Xgt1 1 which had been digested with EcoRI and treated with calf intestinal phosphatase. The DNA was 
packaged and infected into E. coli Y1088. Several positives were detected, and the EcoRI fragments were 
subsequently subcloned into" M13 phage vectors for dideoxy sequencing using either the M13 universal 
primer or Factor VII specific oligonucleotides. 

30 From these, three new cDNA clones of Factor VII were obtained, and their sequences completely 
determined. The largest of these cDNAs, from a clone designated XVII2463, was found to contain the entire 
coding sequence for Factor VII. This clone included a 35 nucleotide 5' untranslated region, 180 nucleotides 
coding for a 60 amino acid leader, 1218 nucleotides coding for the 406 amino acid mature protein, a stop 
codon, 1026 nucleotides of 3' untranslated sequence, and a 20 base poly(A) tail (beginning at position 

35 2463). This cDNA has now been sequenced in its entirety on both strands. A comparison of it with two 
cDNAs isolated earlier from clones XVII2115 and XVII1923 revealed that clone XVII2463 contains an 
additional 321 nucleotides upstream of the insert in XVII2115 and 519 nucleotides upstream of the insert in 
XVI11923. The overlapping Factor VII sequences of XVII2463 and these two previous cDNAs agree, except 
that the cDNA of XVII2463 does not contain single base deletions at positions 1005 and 1106, which were 

40 detected in the cDNA of XVII2115. Thus, XVII2463 contains, on a single EcoRI fragment, a Factor VII cDNA 
coding for Factor VII leader and mature protein sequences. 

An additional cDNA, XVII565, was isolated and found to contain 5* terminal Factor VII sequences, but 
was truncated within the coding sequences. Its 5' end maps at nucleotide 9 (Figure 1 b). 

When compared with full-length XVII2463, XVII565 was found to lack a sequence corresponding to one 

45 exon-like region within the leader sequence. Bases 100-165 are absent from XVII565 (Figure 1b). The 
absent sequences correspond precisely to one exon-like region by comparison with genomic sequence data 
(as described in Example III). Thus, the XVII565 structure may be a consequence of alternative splicing 
events in the leader sequence. 

The leader encoded by XVII2463 is exceptionally long (60 amino acids) and has a very different 

so hydrophobicity profile when compared with Factor IX, protein C and prothrombin. This leader contains two 
Mets, at positions -60 and -26. Initiation most likely begins at the first Met, since a hydrophobic region, 
typical of signal peptides, follows the Met at position -60, but not the Met at -26. It is interesting that the 
absent sequences in XVII565, which corresponds precisely to an exon-like region in the genomic clone, 
results in a 38 amino acid leader with a hydrophobicity pattern more analogous to the above proteins. 

55 
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Example 2: Animo Acid Sequence of Human Factor VII . 

The elucidation of the amino acid sequence of human Factor VII was desired In order to confirm the 
identity of putative cDNA clones, substantiate the sequence of Factor VII cDNA, provide information allowing 
5 for the synthesis of specific oligonucleotide probes to screen cDNA and genomic libraries for clones 
containing the 5' sequence, and to construct a synthetic fragment encoding the ami no-terminal portion of 
Factor VII. Although limited amino acid sequence was provided by Kisiel and McMullen (ibid), more 
information was needed. 

Purified human Factor Vila (Kisiel and McMullen, ibid) was reduced and carboxymethylated by the 

w method of Crestfield et al., J. Biol. Chem. 238 : 622, 1963. The light and heavy polypeptide chains of 
carboxymethylated Factor Vila were separatecTby high-performance liquid chromatography (HPLC) on a 
Micro Pak C18 reverse phase column (Varian Corp.) by generating a gradient of 0.1% TFA in distilled water 
(A) and 0.1% TFA in acetonitrile (B) from 0-40% B in 5 minutes, 40-80% B in 25 minutes and 80-100% B in 
5 minutes. Approximately 300 picomoles of each peptide chain were analyzed by automated Edman 

15 degradation using a Gas-Phase Protein Sequencer (Applied Biosystems, Inc.). Eighteen and 29 residues 
were identified at the amino-termini of the heavy and light polypeptide chains, respectively. The amino- 
terminal sequence of the heavy chain of Factor Vila was consistent with that encoded by cDNA clone 
pUCVII2115 (Figure 2b). Amino acid residues are designated within Figures 2a and 2b by single letter code 
as follows: A, alanine; C, cysteine; D, aspartic acid; E, glutamic acid; F, phenylalanine; G, glycine; H, 

20 histidine; I, isoleucine; K, lysine; L, leucine; M, methionine; N, asparagine; P, proline; Q, glutamine; R, 
arginine; S, serine; T, threonine; V, valine; W, tryptophan; Y, tyrosine; X indicates an unknown residue and * 
indicates that the Gla residues (y) were assigned by homology to the structures of other known clotting 
factors and by the absence of any other phenylthiohydantoin-amino acid at those positions. The gaps (-) are 
placed to provide the best alignment among the sequences. In addition, the information indicated that the 

25 amino acids at positions five and nine were lysines and not threonine and arginine, respectively, as 
previously reported (Kisiel and McMullen, ibid). The sequence analyses of the light chain of Factor Vila, 
which originates from the amino-terminal region of Factor VII, fell short by approximately 6 residues to 
overlap with the structure encoded by the 5' end of cDNA clone pUCVII2115. 

To obtain additional sequence data, two nanomoles of the carboxymethylated light chain were digested 

30 for 12 hours by bovine chymotrypsin (1:100 w/w, enzyme: substrate) in 0.1 M ammonium bicarbonate, pH 
7.8, at 37 °C. The generated fragments were purified by HPLC on a Micro Pak C18 reverse phase column 
using the above solvents in a gradient of 0-30% B in 5 minutes, 30-60% B in 25 minutes and 60-80% B in 
10 minutes. Peptides were identified by their U.V. absorption at 220 and 280 nm. Lyophilized peptides 
(approximately 1 nanomole each) were analyzed by Edman degradation. The results (Figure 2b) confirmed 

35 much of the cDNA sequence in the corresponding region of clone pUCVII2115. In total, 113 of 152 residues 
(75%) of the light peptide chain of Factor Vila were identified. This sequence is identical to that encoded by 
the known cDNA structure. Indirect evidence indicates Asn 145 is a site of carbohydrate attachment. 

Example 3: Cloning of the genomic Factor VII sequence. 

40 ~~ ' ~ ~ 

As one approach to providing the 5' end sequence lacking from the cDNA, a lambda phage library 
containing human fetal liver DNA (Lawn et al., Cell 15: 1157-1174) was screened with nick translated Factor 
VII cDNA. A portion of the genomic library was plated on E. coli LE392 (ATCC 33572) to produce a total of 
7.2 x 10 6 plaques (Maniatis et al., ibid, pp. 320-321). The phage plaques were adsorbed from the plates 

45 onto nitrocellulose and hybridized with the 32 P-labeled cDNA according to the procedure of Benton and 
Davis (Science 196: 180, 1977). Eight clones were obtained and plaque purified. 

Using a DNATragment (Eco Rla-Xba I, Figure 1) from the 5' end of the Factor VII cDNA (XVII2115) and 
standard techniques (Maniatis et al., ibid) those genomic clones containing 5' end sequences were 
identified. These phage were designated 7m1, 7m2 and 7m3. DNA was prepared from these recombinant 

50 phage and preliminary restriction endonuclease maps derived. Phage 7m1, which gave the strongest 
hybridization signal, was used to generate a more extensive restriction map and to place the Eco Rl-Xba I 
cDNA sequences on this map by Southern blotting (Southern, J. Mol. Biol. 98: 503, 1975). 

In order to determine if phage 7m 1 contained the DNA sequences encoding the amino terminal amino 
acids of the Factor VII protein, Southern blots of phage DNA restriction digests were hybridized with 

55 mixtures of oligonucleotides whose sequences were deduced from the Factor VII amino terminal amino acid 
sequence. Oligonucleotides ZC188, ZC360, and ZC401 (Table 1) were radioactively labeled with T 4 
polynucleotide kinase and hybridized to the phage DNA blots at a few degrees centigrade below their Tm 
(Wallace, R. B., et al., Nuc. Acids Res. 6: 3543-3557, 1979). The results of this analysis indicated that a 3.7 
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kb Sst I fragment of 7m1 contained sequences hybridizing to these oligonucleotides. This Sst I fragment 
was subcloned into M13 for DNA sequence analysis. Results obtained using ZC360 as sequencing primer 
identified a region approximately 60 nucleotides in length, which corresponded to the amino-terminal protein 
sequence data. 



Oligonucleotide Sequence 

ZC87 TCC CAG TCA CGA CGT 

T G A 

ZC188 GCC GGG CTCA CTC CTC CA GAA GGC GTTGG 

C A G 



ZC212 


GAC 
TCA 


CTG 
TGG 


CAG 


GAT 


CCA 


TGC 


AGC 


GCG 


TGA 


ACA 


TGA 


ZC213 


GAG 
GCG 


GCC 
CTG 


TGG 


TGA 


TTC 


TGC 


CAT 


GAT 


CAT 


GTT 


CAC 


ZC217 


ATG 


AGA 


AGC 


GCA 


CGA 


AG 












ZC218 


CTC 


TGC 


CTG 


CCG 


AAC 














ZC235 


GAT 


CCA 


TGC 


AGC 


GC 














ZC249 


AGA 


ACA 


GCT 


TTG 


TTC 


TTT 


CA 










ZC275 


GCC 


CCC 


ATT 


CTG 


GCA 














ZC286 


CCA 
CCT 


AAG 
GGG 


AGG 
AGC 


GCC 
CTG 


AAC 
GAG 


GCC 
AGA 


TTC 
GAG 


CTG 
TGT 


GAG 
ATT 


GAG 
GAG 


AGA 
G 


ZC287 


AAT 
CTC 


ACA 
CAG 


CTC 
GAA 


TCT 
GGC 


CTC 
GTT 


CAG 
GGC 


GCT 
CCT 


CCC 
CTT 


AGG 
TGG 


TCT 


CTC 


ZC288 


AGC 
TCG 


AGT 
AGG 


GTA 
CCA 


GCT 
GCG 


TCG 
ACG 


AGG 


AGA 


ACA 


GAG 


AGG 


TTT 


ZC289 


AAT 
TCC 


TCG 
TCG 


TCG 
AAG 


CTG 
CTA 


GCC 
CAC 


TCG 
TGC 


AAA 
TCC 


ACC 


TCT 


CTG 


TTC 


ZC333 


CAG 


CTT 


CGT 


CCT 


GTC 


GCT 


GGC 


CTC 
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70 



25 



2C336 CCT CTT TGG GCC TGG TGA 

C C C C G 
2C360 CA TC TC TC TT CA 

T T T T A 

ZC401 CGT AGC GTT CAG GCC CTC GAA GAT CTC GCG GGC 

CTC CTC GAA GCT ACA C 



Since genomic clone 7m 1 was known to contain 7kb of sequences upstream of exon 2, this clone was 
anticipated to encode Factor VII S'-untranslated sequences and the leader sequences up to the amino acid 
75 position -17. In order to confirm that exon 1 was encoded within genomic clone 7 ml, the leader sequence 
information from clones XVII2463 and XVII565 was used to design oligonucleotides ZC528 and ZC529 
(shown below). 

20 ZC528 

5 'tCA ACA GGC AGG GGC AGC ACT GCA GAG ATT 3 1 



ZC529 

5 ' TTC CAC GGC ATG TCC CGT GTT TCT CCT CCT 3 ' 



These were used to probe 7m1 DNA, and a sublcone, 7SD, was found that hybridized to both 
30 oligonucleotides. Exon 1 was determined to be composed of two exonic sequences: exon 1a, which 
hybridized to ZC528 (corresponding to nucleotides 1 to 30 in XVII2463), and exon 1b, which hybridized to 
ZC528 (corresponding to nucleotides 119 to 148 in XVII2463). The intron sequences flanking both exons 1a 
and 1b have been sequenced: 1a contains a consensus splice donor sequence at the 3' end of the exon, 
and 1b is flanked on each terminus with a consensus splice acceptor (upstream of 1b) or donor 
35 (downstream of 1b) sequence. The position of exon 1a within genomic clone 7m1 has been precisely 
mapped, while that of exon 1b has been mapped within a defined region. Exon 1b sequences are present in 
XVII2463, while XVII565 appears to be derived from RNA spliced between exon 1a and exon 2, looping out 
the 1b exonic sequence. 

A variety of 7m1 subclones in pUC and M13 vectors were prepared to facilitate sequencing the 
40 remaining exons. Appropriate oligonucleotides designed from the cDNA sequences, which correspond to 
exons 1 through 7, were used to sequence all but the last exon. The genomic sequence corresponds 
exactly to the cDNA sequences through these regions. In addition, the intron/exon boundaries for exons 1-7 
have been determined, and most are now precisely mapped within clone 7m1. The intron sizes and 
positions within the Factor VII gene are listed in Table 2. 

45 



50 



55 
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Intron/Exon Junctions in the Factor VII Gene 



5 


X ntron 


rum llKJ nLXU rUbl uluil 


Intron Siz^ 


A 


ft 

-39 


N ft O 




B 


-17 


>1.0 


70 


C 


37/38 


1.92 




D 


46 


0.068 




E 


84 


~2 


75 


F 


131 

• 


-1 




G 


167/168 


0.56 




H 


209 


1.31 



20 

Phage 7m1 was known to lack the Factor VII 3'-terminus, which includes exon 8. In order to obtain 
these sequences, a 12-13 kb-enriched Bam HI library in XL47.1 (Loenen and Brammer, Gene 10: 249, 1980; 
Maniatis, et al., ibid.), derived from human dermal primary fibroblast cells, was probed with two nick- 
translated Factor VII cDNA Pstl fragments (corresponding to sequences in exon 7, and to 3'-untranslated 
25 sequences). A clone, designated 7DC1 , was detected by both probes. Subsequent restriction endonuclease 
and Southern blot analysis established that clone 7DC1 overlaps with, and extends approximately 3 Kb 
beyond the terminus of, clone 7m1, and that it contains exon 8. The 3.9 Kb (Xbal-BamHI) fragment from 
7DC1 DNA containing exon 8 was subcloned into M13, and sequence analysis was performed using 
oligonucleotides complementary to its 5* and 3' termini. The entire exon sequence is present in this clone. 

30 

Example 4: Factor IX-Factor VII Hybrid Genes Containing a Synthesized Coding Sequence. 



A. Construction of a hybrid Factor IX leader-synthetic Factor VII 5 f coding sequence. 

35 The second alternative for obtaining the 5' coding sequence for Factor VII was synthesis of an 
appropriate double-stranded fragment, using a nucleotide sequence predicted on the basis of the amino 
terminal amino acid sequence of Factor VII, the amino acid sequences of other vitamin K-dependent clotting 
factors, and the known nucleotide sequences of other vitamin K-dependent clotting factor genes (Kurachi 
and Davie, ibid; Anson et al., EMBO J. 3: 1053-1060,1984; and Davie et al., ibid). In order to provide the 

40 necessary secretion and processing signals for secretion of a mature Factor VII analog, this synthetic 
fragment (the consensus sequence) was joined to one of two leader sequences derived from a Factor IX 
cDNA clone. This strategy is outlined in Figure 3. 

A cDNA coding for human Factor IX was obtained from a library made with mRNA from human liver 
(Kurachi and Davie, ibid). The Factor IX sequence was isolated from the pBR322 vector by digestion with 

45 Pst I and was inserted into the Pst I site of pUCl3. This plasmid was designated FIX-pUC13. In order to 
remove the G-rich region which was present at the 5' end of the Factor IX insert as a result of cDNA 
cloning, a synthetic oligonucleotide adaptor was substituted for the 5' end of the cloned fragment. 
Oligonucleotides ZC212 and ZC213 (Table 1) were synthesized and annealed to generate a 22 base pair 
overlap, the fragment ends filled in and cut with appropriate restriction endonucleases, and the resulting 

50 fragment was joined to the Factor IX sequence. 

To construct the adaptor, 100 pmoles each of ZC212 and ZC213 were lyophilized and resuspended in 
10 ul of 10x kinase/ligase buffer (600 mM Tris pH 8.0, 100 mM MgCI 2 , 100 nM DTT) plus 86 ul H 2 0. The 
annealing reaction was run at 65 °C for 10 minutes, the mixture was slowly cooled to room temperature and 
put on ice. To this mixture was added 4 ul of 2.5 mM dNTP mix and 1 ul (8 units) T* DNA polymerase. The 

55 reaction was allowed to proceed 45 minutes at 14 W C. Ten ul of 5 M NH 4 OAc was then added and the DNA 
was extracted once with phenol/CHCb, twice with CHCb, and was precipitated with ethanol. The DNA was 
centrifuged and resuspended in 100 ul medium salt buffer (Maniatis et al., ibid, p. 100), digested with 9 
units Pst I and 8 units Cfo I, and extracted as above. 
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The modified Factor IX sequence was then constructed by combining 0.16 pmoles of the synthetic Pst 
l-Cfo I adaptor fragment, 0.14 pmoles of a 1.4 kb Cfo l-Bam HI Factor IX fragment from FIX-pUC13, and 
0.14 pmoles of a 2.7 kb Bam Hl-Pst I pUC13 vector fragment in a 20 ul reaction containing 60 mM Tris-HCI 
pH 7.5, 10 mM MgCk, 10 mM DTT, and 0.9 units T4 ligase. The reaction was incubated for 3 h at room 

5 temperature and used to transform competent E. coli JM83 (Messing, Recombinant DNA Technical Bulletin, 
NIH Publication No. 79-99, 2, No. 2, 43-48, 1979).~The cells were plated with 50 ul of 2% X-gal (5 bromo-4- 
chloro-3 idolyl-/8-D-galactoside) on L-broth containing 40 ug/ml. ampicillin and incubated at 37 'C overnight. 
White colonies were picked onto another plate containing ampicillin and grown at 37 °C overnight. The 
colonies were blotted on Whatman 540 paper and the paper prepared for hybridization according to the 

70 method of Wallace et al. (Gene 16: 21, 1981), except the overnight incubation on chloramphenicol plates 
was omitted. The papers were incubated at 44 *C for 2 h in 0.9 M NaCI, 0.09M Tris-HCI pH 7.5, 6 mM 
EDTA, 0.5% Nonidet P-40, 150 ug/ml E. coli tRNA. The papers were probed with 32 P-labeled ZC235 (Table 
1), a 14-mer that is specific for the altered 5' end sequence. Hybridization with 1-2 x 10 6 cpm per filter was 
carried out at 44 • C in the prehybridization buffer overnight. The filters were then washed 3 times in 6 x 

75 SSC, 0.1% SDS at 4* C and 3 times in 2 x SSC, 0.1% SDS at 44 "C and exposed to X-ray film. Two 
positive clones were obtained. One of these clones was designated FIX (-G) -» pUC13. 

In order to confirm the sequence of the altered region of the Factor IX portion of the FIX(-G) pUC13 
construct, dideoxy sequencing directly on the pUC plasmid using the BRL reverse primer was performed 
using the method of Wallace et al., 1981 (ibid) using a primer end labeled with polynucleotide kinase and 

20 7 32 P ATP by the method of Chaconas et al. (ibid). The sequence was as predicted. 

The resulting recombinant plasmid contains three Hae III cleavage sites, the first at position 39 in the 
Factor IX sequence (numbering is based on the published sequence of Anson et al. (ibid), beginning at the 
first ATG), the second at position 130, and a third in the pUCl3 polylinker. The site at 130 is a single base 
pair upstream from the codons for the Lys-Arg processing site of the prepo Factor IX molecule. In the final 

25 Factor IX-Factor VII hybrid constructs, the Factor IX leader sequence, terminated at position 39 or 130, was 
joined to a synthetic double-stranded fragment comprising the predicted consensus sequence and the last 3 
codons of the Factor IX leader sequence. 

The synthetic consensus fragment was produced by joining oligonucleotides ZC286-ZC289 (Table 1) to 
form a double-stranded fragment. One hundred pmole of each oligonucleotide was Jyophiiized and 

30 resuspended in 20 ul of 1x kinase buffer and incubated overnight at 4*C; then heated at 65 *C for 10 
minutes. Two pools were made using the kinased oligonucleotides. Pool 1 contained ZC286 + ZC287; pool 
2 contained ZC288 + ZC289. The pooled pairs were annealed 10 minutes at 65 *C, then cooled to room 
temperature over a period of 2 hours and placed on ice for 30 minutes. 

The modified Factor IX fragment was removed from FIX(-G) — pUC13 as a Hind lll-Eco Rl fragment. 

35 Approximately 20 ug of plasmid was digested with 30 units each of Hind III and Eco Rl in 100 ul Hind III 
buffer (BRL) containing 4 ug RNase A at 37* C overnight. The reaction was terminated by heating at 65 °C 
for 10 minutes, and the vector and Factor IX fragments were electrophoresed on a 1% agarose gel and 
purified by electro-elution. The Factor IX fragment was precipitated with ethanol, resuspended in buffer 
containing 400 ng/ul RNase A, and digested with 9 units of Hae III overnight at 37 *C. The Hind lll-Hae III 39 

40 base pair Factor IX fragment was isolated from this digest by electrophoresis on a 1.5% agarose gel 
followed by electro-elution. To obtain the Hind lll-Hae III 130 base pair Factor IX fragment, FIX-pUC13 was 
digested with Eco Rl and Hind III and the Factor IX fragment isolated as above. Approximately 3 ug of this 
Hind lll-Eco Rl fragment was digested with 6 units of Hae III at 37 °C and aliquots were removed at five 
minute intervals over 30 minutes into a solution containing 50 mm EDTA. The aliquots were pooled and the 

45 Hind lll-Hae III 130 base pair fragment was purified by electrophoresis on a 5% acrylamide gel followed by 
electro-elution. 

The final Factor IX-consensus sequence hybrids were prepared by joining, in a four-part ligation, 
oligonucleotide pools 1 and 2, Factor IX Hind lll-Hae III (39 or 130 base pairs), and pUC13 Hind lll-Eco Rl. 
The resulting plasmids were used to transform E. coli HB101 (ATCC 33694). Colonies were screened by 

50 digestion of DNA with Eco Rl and Hind III. The sequence comprising the 39 base pair Factor IX sequence 
joined to the synthetic consensus sequence is hereinafter referred to as mini-FIX-FVII. The plasmid 
containing this construct was designated pM7200(-C). The sequence comprising the 130 base pair Factor IX 
sequence joined to the synthetic consensus sequence is referred to as maxi-FIX-FVII. The plasmid 
containing this construct was designated pM7100(-C). The consensus sequence encodes a polypeptide 

55 comprising the amino acid sequence Ala-Asn-Ala-Phe-Leu-Gla-Gla-Arg-Pro-Gly-Ser-Leu-Gla-Arg-Gla-Cys- 
Lys-Gla-Gln-Cys-Ser-Phe-Gla-Gla-Ala-Arg-Gla-lle-Phe-Gla-Gly-Leu-Asn-Arg-Thr-Lys-Leu. 
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B. Joining Factor IX-consensus sequence hybrid fragment to Factor VII cDNA clone. 

The Factor IX-consensus sequence hybrids (either mini or maxi) were joined to the 5' portion of the 
Factor VII cDNA and the vector pUC13 in a three-part ligation (Figures 4 and 5). The vector fragment was 

5 produced by digesting 6 ug of pUC13 with 10 units each of Xba I and Hind III in Hind III buffer containing 
RNase A (400 ng/ul). The mini-FIX-FVII fragment was produced by digesting 2 ug of pM7200(-C) with 10 
units each of Hind III and Eco Rl as above. The maxi-FIX- FVII fragment was similarly prepared from 
pM7100(-C). The 5' portion of the Factor VII cDNA was prepared from a plasmid (pUCG705) comprising the 
Eco Rl-Xba I 5* fragment of pUCVII2115 subcloned into pUCl3 by digestion with Xba I and Eco Rl. Digests 

w were run at 37 *C for 2 hours and the products were separated by electrophoresis on a 1.5% agarose gel. 
The desired fragments were electro- eluted, extracted with phenol/CHCh and CHCfo, and precipitated with 
ethanol. The three fragments, pUCl3/Xba l-Hind III, Factor IX-Factor VII (mini or maxi)/Hind lll-Eco Rl, and 
5' Factor Vll/Eco Rl-Xba I were then ligated in 20 ul of ligase buffer containing 2 ul 20 mM ATP and 0.9 unit 
T4 DNA ligase overnight at 4*C. Colonies were screened by restriction analysis with Hind III and Xba I. The 

15 recombinant plasmids containing the mini- and maxi-FIX-FVII sequences were designated pM7200 and 
pM7100, respectively (Figure 4). 

Due to the linker addition used in producing the Factor VII cDNA, modifications had to be made in the 
fusion sequences to generate correct in frame coding sequences. Both mini- and maxi-fusions contain an 
Eco Rl site at the junction between the Factor IX-consensus sequence hybrid and the Factor VII cDNA 

20 which is an artifact of the cDNA cloning process. In addition, the mini-fusion requires the addition of a C to 
change the sequence at the Hae III site from 5 AGGCCA 3 to 5 AGGCCCA 3 ' and establish the correct reading 
frame downstream of this sequence. These corrections were made by oligonucleotide-directed site specific 
mutagenesis, essentially as described for the two-primer method by Zoller and Smith (Manual for Advanced 
Techniques in Molecular Cloning Course, Cold Spring Harbor Laboratory, 1983). The mini-FIX-FVII frag- 

25 ment was removed from pM7200 by digestion with Hind III and Xba I and inserted into M13mp19. The 
maxi-FIX-FVII fragment was purified from pM7100 and subcloned in a similar manner. The mutagenic 
primers ZC333 and ZC336 (see Table I) were used for removal of the Eco Rl site and the base insertion, 
respectively. In each case, the universal primer ZC87 was used as the second primer. The mutagenic 
primers were phosphorylated by combining 40 pmoles of primer and 60 pmoles ATP with 1 unit of T* DNA 

30 kinase overnight at 60 °C. To remove the Eco Rl site from the maxi-FIX-FVII hybrid, 1 ug of the M13 single- 
stranded template was combined with 20 pmoles each ZC333 and ZC87 in a total volume of 10 ul. The 
primers were annealed to the template for 10 minutes at 65 *C, cooled to room temperature for 5 minutes, 
then placed on ice for 5 minutes. The primers were extended using DNA polymerase I (Klenow fragment). 
To remove the Eco Rl site and correct the reading frame in the mini-FIX-FVII hybrid, 1 ug of the appropriate 

35 M13 single-stranded template was combined with 20 pmoles each ZC333, ZC336 and ZC87. Annealing and 
primer extension reactions were carried out as described above. Plaque lifts were screened with 32 P-labeled 
primer (ZC333 or ZC336) at 60 *C and sequences confirmed by dideoxy sequencing. The resultant 
constructs, comprising the maxi- and mini-FIX-FVII sequences, were designated pM7111 and pM7211, 
respectively. 

40 The consensus sequence contains several regions which do not conform to the protein sequence data 
obtained for Factor VII (Figure 2). In order to produce a sequence which encodes a polypeptide with greater 
homology to the amino-terminal portion of Factor VII, the consensus sequence was altered by 
oligonucleotide-directed site-specific mutagenesis. The changes made were the insertion of Leu at position 
8, substitution of lie for Lys at position 18 (numbers refer to the amino acid position after the inseertion at 

45 position 8), Asn for Ala at position 26, and the sequence Ala-Ser-Asp for Gly-Leu-Asn at positions 32-34 
(based on tentative amino acid sequence data). 

The sequence changes at positions 8 and 18 were made using pM7111 (sense strand) as template. 
Primers ZC352 ( 5> CCC AGG TCT CAG CTC CTC CAG 3 ) and ZC353 ( 5 CTG CTC CTC CTT ACA CTC 
TCT 3 ) were annealed to the template and extended as described above. The resultant phage clone was 

50 designated pM7114. The sequence of the insert in pM7114 was confirmed by dideoxy sequencing. 

In a similar manner, the changes at positions 26-34 were made on the pM7114 template (sense strand) 
using the mutagenic primer ZC366 ( 5 CAG CTT CGT CCT GTT CAG GCC CTC GAA GAT CTC GCG GGC 
CTC CTC GAA 3 ') and ZC87 (Table 1) as second primer. The resultant construct was designated pM7115. 
The sequence of the entire 550 bp insert in the M13 vector was determined by dideoxy sequencing and 

55 found to be correct. 
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Example 5: Construction of Factor IX-Factor VII cDNA fusion. 

The Factor IX-Factor VII cDNA fusion was prepared using Factor IX cDNA obtained from a human liver 
cDNA library as described by Kurachi and Davie (ibid) and the Factor VII cDNA sequence described in 
5 Example 1. 

The fusion point chosen for the hybrid protein was between amino acid +38 (threonine) of Factor IX 
and the first lysine encoded by the Factor VII cDNA sequence. Such a protein would be encoded by a 
sequence consisting of the first 252 bp of the Factor IX cDNA sequence and all of the pUCVII2115 Factor 
VII cDNA sequence except the first two codons. To construct this hybrid sequence, the Factor IX sequence 

10 was first fused to pUCVI!2115 using convenient restriction sites. This fusion resulted in the plasmid 
FIXA/ll/12 (described below) which contains the first 310 bp of the Factor IX cDNA joined to the entire 
Factor VII cDNA sequence. To achieve the precise junction desired for the hydrid protein, the intervening 
base pairs were removed by oligonucleotide-directed mutagenesis. 

Joining of the Factor IX cDNA sequence to the Factor VII cDNA sequence was accomplished by ligating 

75 a 0.3 kb Hind Ill-Ana III fragment of FIX (-G) — pUC13 (Example 4) to a 4.7 kb Sma l-Hind III fragment from 
pUCVII2115 (Figure 5). The Hind Ill-Ana III fragment was prepared by digesting 3 ug of FIX(-G) — pUC13 
with 40 units of Hind III in 40 ul of medium salt buffer (Maniatis et al., ibid) at 37 °C, 4 hours. The volume 
was then increased to 100 ul of medium salt buffer, and 5 units of Aha III were added and the 37 °C 
incubation continued for 18 hours. The DNA fragments were separated by electrophoresis in 1% agarose 

20 and the 0.3 kb band isolated as described above. A Sma I parital digestion of pUCVII2115 was obtained by 
incubating 3 ug of pUCVII2115 at 25 °C for 1 hour with 4.8 units of Sma I in a reaction volume of 30 ul. The 
reaction was stopped by a 15-minute incubation at 65 'C. The sample was then extracted once with an 
equal volume of phenol and ethanol precipitated. 

The precipitate was collected by a 10-minute microfuge spin, rinsed with 70% ethanol and air dried. 

25 The DNA was redissolved in 30 ul of medium salt buffer and digested with 30 units of Hind III at 37 *C for 3 
hours. The DNA was subjected to electrophoresis in 0.7% agarose and the 4.7 kb Hind Ill-Sma I fragment 
isolated as described above. Equimolar amounts of the two fragments (0.048 pmoles) were ligated in a 10 
ul reaction containing 50 mM Tris-HCI pH 7.5, 10 mM MgCfe, 1 mM DTT, 1 mM ATP, and 3 units of T 4 
DNA ligase at 14° C for 3.5 hours and then used to transform competent E. coli RRI (ATCC 31343). The 

30 cells were grown on ampicillin plates and 12 of the resulting colonies were screened by restriction enzyme 
digestion for the presence of the desired plasmid construction. DNA from colony 12 (FIXA/ll/12) gave the 
expected restriction enzyme digestion pattern and was used in the next step of the hybrid gene 
construction. 

The oligonucleotide-directed mutagenesis procedure was performed on a single-stranded DNA tern- 

35 plate. Thus, it was necessary to clone the fused Factor IX/Factor VII sequences into M13mp19. To obtain a 
conveniently small DNA fragment, a 640 bp Hind Ill-Xba I fragment was isolated from FIXA/ll/12. This 
fragment contains 310 bp of the 5' end of Factor IX cDNA and 330 bp of the Factor VII sequence. The 
vector was prepared by digesting 1 ug of M13mp19 RF DNA with 20 units of Hind III and 20 units of Xba I 
in 40 ul of medium salt buffer at 37 *C for 18 hours. The DNA was subjected to electrophoresis in 1.2% 

40 agarose and the linear 6.4 kb fragment isolated from the gel as described above. Five ug of FIX/VII/12 DNA 
was digested with 10 units of Xba I in 40 ul of medium salt buffer at 37* for 18 hours. Twenty units of Hind 
III were added and the digestion continued at 37 °C for an additional 7 hours. The resulting fragments were 
separated by electrophoresis in 1.2% agarose and the 640 bp fragment eluted as above. Ten ng of 
linearized m13mp19 and 1 ng of the 640 bp fragment were ligated at 14 P C for 1 hour and then used to 

45 transform competent E. coli JM101 (Messing, Meth. in Enzymology, ibid). The cells were plated with X-gal 
and IPTG (Messing, MethTTn Enzymology , ibid) and eight light blue plaques were picked and used to infect 
2.5 ml cultures of E. coli JM103 at Aeoo = 0.3. After 18 hours 1 growth at 37 °C, the cells were harvested by 
centrifugation in aToorrTtemperature clinical centrifuge and 20 ul of the supernatant which contains the M13 
phage was mixed with 10 ug/l ethidium bromide. By comparison with known standards, each of the eight 

so clones had an insert of approximatley the correct size. Single-stranded DNA was then prepared from 1.5 ml 
of the supernatants as described by Messing (Meth. in Enzymology, ibid). This construct was then 
sequenced by the dideoxy method using the oligonucleotide ZC87 as a primer to confirm that the insert 
junction was correct. One of the correct clones (#4) was used as a template in oligonucleotide-directed 
mutagenesis to produce a functional Factor IX-Factor VII fusion. 

55 The oligonucleotide ZC249, a 20-mer consisting of 10 bp of the desired Factor IX sequence and 10 bp 
of the desired Factor VII sequence (Table I) was used as the mutagenic primer. The oligonucleotide ZC87, 
which hybridizes to the M13mp19 sequence, was used as the second primer. 
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The mutagenesis procedure was modified from that of Zoller and Smith (ibid). For the annealing 
reaction, 20 pmoles of ZC249 were phosphorylated by incubating overnight at 4 m C in 20 ul 60 mM Tris-HCl 
pH 8.0, 10 mM MgCb, 1 mM DTT, 1mM ATP, 1 unit T* kinase. The reaction was stopped by incubation at 
65 *C for 15 minutes, and the sample was lyophlized. One pmole of single-stranded clone #4 template and 

5 20 pmole of ZC87 were added in 10 ul annealing buffer (200 mM Tris-HCl pH 7.5, 100 mM MgCfe, 500 mM 
NaCI, 10 mM DTT). The sample was heated to 65 *C for 10 minutes, incubated at room temperature for 5 
minutes, and then placed on ice. Ten ul of the following solution was prepared fresh and added to the 
sample: 20 mM Tris-HCl pH 7.5, 10 mM MgCfe, 10 mM DTT, 1 mM dNTPs, 1 mM ATP, 0.15 units/ul T 4 
DNA ligase, 0.25 units/ul E. coli DNA Polymerase I (Klenow fragment). The reaction was then incubated at 

10 15 *C for 3 hours and the _ sample used to transform competent E. coli JM101 (Messing, Meth. in- 
Enzymology, ibid). ~ " 

The resulting plaques were lifted onto nitrocellulose and screened by hybridization to 32 P-labeled 
ZC249. Dry BA85 filters (Schleicher & Schuell, 0.45 urn) were laid onto the agar plate and the phage 
allowed to adsorb for 5 minutes. The filters were removed and allowed to dry for 5 minutes, placed on 

75 Whatman 3 MM paper, saturated in 0.5 M NaOH, 1 .5 M NaCI for 5 minutes, air dried for 3 minutes, placed 
on Whatman paper, saturated in 1 M Tris-HCl pH 8, 1 .5 M NaCI, for 5 minutes, and air dried for 3 minutes. 
The Tris-HCl step was repeated and the filters were rinsed in 100 ml 6 x SSC for 2 minutes at room 
temperature. After air drying, the filters were baked at 80 *C for 2 hours and prehybridized at 47 *C (T m -4* 
of ZC249) overnight in 6.7 x SSC pH 6.5, 2 mg/ml E. coli tRNA, and 0.2% (w/v) each BSA, Ficoll, and 

20 polyvinylpryroiidine. 

After the prehybridization step, the filters were incubated with 2.5 x 10 6 cpm/filter of labeled ZC249 in 
the same SSC hybridization buffer at 47 "C overnight. Following hybridization, the filters were washed 3 
times, 5-10 minutes each, at room temperature in 6 x SSC and exposed to X-ray film. Putative positive 
plaques were replated and screened as above. Individual plaques were then picked, and single-stranded 

25 DNA was prepared and sequenced using ZC275 as a primer. The oligonucleotide ZC275 corresponds to a 
sequence 40 bp in the 5' direction of ZC249 on the same strand (Table 1). 

Four positive plaques were identified. The entire insert in M13mp19 for one clone (FIX/VII-9) was 
sequenced by the dideoxy method using the oligonucleotides ZC87 and ZC275 and determined to be 
correct. The confirmed sequence is represented by bases 1-567 in Figure 7. RF DNA from this clone was 

30 then used for the final step in the construction of the hybrid gene. 

Three fragments were used to make the final construction: the 0.6 kb Hind Ill-Xba I fragment from 
FIX/VII-9 containing the fused IX/VII sequences; a 1.7 kb Xba l-Bam HI Factor VII cDNA fragment from 
pUCVII1923; and a 2.7 kb Bam HJ-Hind III fragment of pUC13. Three ug of FIX/VII-9 (RF DNA) were 
digested at 37 ■ C for 6 hours with 45 units of Xba I in a volume of 50 ul. The DNA was precipitated with 

35 ethanol, resuspended and digested at 37 "C for 4 h with 50 units of Hind III. The sample was subjected to 
electrophoresis in 1% agarose and the 0.6 kb band electro-eluted from the paper with 1.5 M NaCI, 50 mM 
Tris-HCl pH 8, 1 mM EDTA, phenol extracted and precipitated with ethanol. 

To obtain the remaining Factor VII cDNA sequence, 5 ug of pUCVII1923 was digested at 37'C for 3 
hours with 36 units of Xba I in 40 ul of medium salt buffer. Then 8 ul of 10x high salt buffer, 28 ul of H20, 

40 and 4 ul (40 units) of Bam HI were added and the reaction incubated at 37 °C for 3 hours. The DNA 
fragments were separated by electrophoresis in 1 % agarose and the 1 .7 kb fragment isolated as described 
above. 

The vector fragment was prepared by digesting 1 ug of pUC13 with 10 units of Hind III in 20 ul of 
medium salt buffer at 37 *C for 1 hour. Two ul of 10x high salt buffer and 10 units of Bam HI were then 
45 added and the incubation continued for another 2 hours. The DNA was purified on a 1% agarose gel as 
described above. 

Equimolar amounts (approximately 0.56 pmoles) of the three fragments were ligated at room tempera- 
ture for 45 minutes in 10 ul of 50 mM Tris-HCl pH 7.5, 10 mM MgCI 2 , 1 mM DTT, 1mM ATP and 3 units T* 
DNA ligase. The reaction mixture was used to transform competent E. coli JM83. The cells were plated on 
50 medium containing 40 ug/ml ampicillin with 50 ul of 2% X-gal addedlo*iach plate. DNA was prepared from 
7 white colonies and then screened by restriction enzyme digestion. One of the clones giving the correct 
pattern was designated FIX/VII -> pUC13. 

Example 6: Expression of Biologically Active Factor VII Analogs . 

55 " " — — — ■ 

The mammalian cell expression vector pD2 was chosen for expression of the FIX/VII gene in 
transfected animal cells. It was constructed from plasmid pDHFR-lll (Berkner and Sharp, Nuc. Acids Res. 
13: 841-857, 1985) in the following manner. The Pst I site abutting the DHFR cDNA in pDHFR III was 
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converted to a Bam HI site by conventional linkering (Scheller, R.H., Dickerson, R.E., Boyer, H.W., Riggs, 
A.D., and Itakura, K., Science 196: 177-180, 1977). The pDHFR III DNA was incubated with 10 mM Tris pH 
7.6, 6 mM 0-MSH, 6 mM NaCTlO mM MgCI 2 and 2.5 units Pst I for 10 minutes at 37 °C, followed by 
phenol extraction and ethanol precipitation. The Pst I cohesive termini were blunt ended using T 4 DNA 

5 polymerase. After phenol extraction and dialysis against 10 mM Tris pH 8.0, 1 mM EDTA, 0.3 M NaCI, the 
DNA was ethanol precipitated. The DNA was resuspended in 20 ul 1.4 mM ATP, 50 mM Tris pH 7.6, 10 
mM MgCb, 1 mM dithiothreitol and then incubated with 5 ng of T 4 polynucleotide kinase-treated Bam HI 
linkers (New England Biolabs) and 200 units of T 4 polynucleotide ligase for 12 hours at 12 °C, followed by 
phenol extraction and ethanol precipitation. The DNA was digested with 90 units of Bam HI at 37 *C for 1 

10 hour, followed by electrophoresis through a 1 .4% agarose gel. The 4.9 kb DNA fragment (corresponding to 
pDHFR III DNA lacking the DHFR cDNA and SV40 polyadenylation signal) was electro-el uted and 
recircularized with polynucleotide ligase and then transfected into E. coli HB101. Ampicillin-sensitive 
colonies were screened by rapid prep analysis (Birnboim, H.C., and DolyTJ., Nucleic Acids Research 7: 
1513-1523, 1979) and the correct clone was grown up to generate a large-scale plasmid DNA preparation." 

75 The resultant plasmid was cleaved with 20 units Bam HI and treated with 2.5 ug calf intestinal 
phosphatase and electrophoresed on a 1.4% agarose gel. Twenty-five ug of pSV40 (a clone of SV40 DNA 
inserted into the Bam HI site of pBR322) were digested with 25 units of Bel I for 1 hour at 50 *C, followed 
by the addition of 25 units of Bam HI, and the incubation continued for 1 hour at 37 • C. This DNA was then 
electrophoresed on a 1.4% agarose gel. The Bam Hl-cut vector (i.e., that lacking the polyadenylation 

20 signal) was joined to the SV40 DNA fragment (.14 to .19 map units [Tooze, J., ed., "DNA Tumor Viruses, 
Molecular Biology of Tumor Viruses"]) containing the late polyadenylation signal by incubating the gel- 
purified fragments (0.1 ug each) in 20 ul 50 mM Tris pH 7.6, 10 mM MgCb, 1 mM dithiothreitol, 1.4 mM 
ATP and 100 units T 4 polynucleotide ligase for 4 hours at 12 °C, followed by transformation into E. coli 
RR1 . Positive colonies were identified by rapid prep analysis, and a large-scale plasmid preparation~of "the 

25 correct DNA, pD2, was prepared. 

To make the Factor IX/VII expression construction, 1 ug of pD2 was digested at 37 • C for 1 hour with 20 
units of Bam HI in 20 ul of high salt buffer. Twenty ul of 10 mM Tris-HCI pH 8, 1 mM EDTA and 0.1 unit of 
calf alkaline phosphatase (Boehringer) were then added. The reaction was incubated at 37 *C for 1 hour and 
stopped by heating to 75 *C for 10 minutes. Ten ug of FIXA/II pUCl3 was digested at 37 *C for 2 hours 

30 with 150 units of Bam HI in 150 ul of high salt buffer. The DNA fragments were separated by 
electrophoresis in 1.2% agarose and the 2.3 kb fragment was isolated. Equimolar amounts (0.015 pmoles) 
of the 2.3 kb Bam HI fragment and the pD2 vector fragment were ligated at 14 *C for 2.5 hours as above. 
The reaction mixture was used to transform E. coli RR1 cells, which were then plated on medium containing 
10 ug/ml ampicillin. Plasmid DNA was prepared from 12 of the resulting colonies and screened by 

35 restriction enzyme digestion. One of the clones with the correct enzyme digestion pattern was designated 
FIX/VII/pD2 (Figure 6). E. coli RR1 transformed with FIX/VII/pD2 has been deposited with ATCC under 
accession number 53068. 

The procedure used to transfect baby hamster kidney (BHK) cells (available from American Type 
Culture Collection, accession number CCL10) with FIXA/ll/pD2 was similar to published methods (for 

40 example, Wigler et al., Cell 14: 725, 1978; Corsaro and Pearson, Somatic Cell Genetics7: 603, 1981; 
Graham and Van der Et^Virology 52: 456, 1973). The BHK cells were grown at 37 'C, 5% CO2, in 
Dulbecco's media (plus 10% heat-inactivated fetal calf serum and supplemented with glutamine and 
penicillin-strep-tomycin) in 60 mm tissue culture Petri dishes to a confluency of 20%. A total of 10 ug DNA 
was used to transfect one 60 mm dish: 3.75 ug of FIXA/ll/pD2, 1.25 ug of pKO-neo (Southern and Berg, J. 

45 Mol. Appl. Genet V. 327-341, 1982) and 5 ug of salmon sperm DNA. The DNAs were precipitated in 0.3 M 
NaOAc, 75% ethanol, rinsed with 70% ethanol and redissolved in 20 ul 10 mM Tris-HCI pH 8, 1 mM EDTA. 
The DNA was combined with 440 ul H 2 0 and 500 ul of 280 mM NaCI, 1.5 mM NaHP0 4f 12 mM dextrose, 
50 mM HEPES pH 7.12. Sixty ul of 2 M CaCfe were added dropwise to the above mixture and the solution 
let stand at room temperature for 30 minutes. The solution was then added to the cells and the cells 

50 returned to 37 • C for 4 hours. The medium was removed and 5 ml of 20% DMSO in Dulbecco's with serum 
were added for 2 minutes at room temperature. The dish was then washed rapidly with 2 changes of 
medium and incubated in fresh medium overnight. Twenty-four hours after the DNA was added, the 
medium was removed and selective medium added (10 mg/ml of G418, 498 ug/mg, Gibco, in Dulbecco's 
with serum). After 10 and 13 days, individual clones, representing cells that had incorporated the pKO-neo 

55 gene and were thus resistant to G418, were transferred to 96-well (or 24-well) plates and grown up for 
protein assays. 

Cells were grown in Dulbecco's plus 10% fetal calf serum containing 5 ug/ml vitamin K (Phytonadione, 
Merck). The medium was separated from the cells and cellular debris by centrifugation, and assayed for 
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Factor VII polypeptide (by ELISA) and for biological activity. The cells were removed from the plates with 
trypsin, washed with fresh medium, centrifuged, and frozen at -20 °C. The cell pellets were then thawed in 
PBS, pelleted, and resuspended in PBS containing 0.25% Triton X-100. Samples were diluted and assayed 
for polypeptide and activity. 

5 The ELISA for Factor VII was done as follows. Two hundred microliters of a monoclonal antibody 
against human Factor VII (5 ul/ml in 0.1 M Na2C03 pH 9.6) were incubated in each well of a 96-well 
microtiter plate 2 hours at 37 °C. The wells were then incubated with 220 ul of 1% bovine serum albumin 
(BSA) and 0.05% Tween 20 in PBS pH 7.2 2 hours at 37 *C. The plates were rinsed with H 2 0, air dried, 
and stored at 4°C. To assay samples, 200 ul samples were incubated 1 hour at room temperature in the 

w antibody-coated wells. The wells were then rinsed four times with 200 ul PBS containing 0.05% Tween 20. 
The wells were then incubated for 1 hour at room temperature with 200 ul of an IgG fraction of rabbit 
polyclonal antiserum against Factor VII (5 ug/ml in PBS containing 1% BSA and 0.05% Tween 20). This 
was followed by incubation with goat anti-rabbit IgG coupled to alkaline phosphatase. The wells were then 
rinsed four times with PBS containing 0.05% Tween 20. To the wells were added 200 ul p-nitrophenyl 

75 phosphate (30 mg) dissolved in diethanolamine buffer (96 ml per liter) pH 9.8 containing 56 mg/l MgCb. 
The enzyme reaction was done at 37 ' C and the development of a yellow color was monitored at 405 nm 
using an ELISA plate reader. Results obtained for cell media are given in Table 3. 

Factor VII biological activity was assayed by the one-stage clotting assay described by Quick 
(Hemorragic Disease and Thrombisis , 2nd ed., Leat Febiger, Philadelphia, 1966). Results obtained for cell 

20 media are given in Table 3. 

Cells/ml Factor VII Factor VII 

Day (x!0~ 4 ) polypeptide ng/ml activity (ng/ml) 

25 

1 2.9 

2.7 25 6.0 



30 



35 



1.9 

2.8 47 15.9 

1.96 

2.26 160 93 

4.71 

4.14 550 300 



45 5 8.79 

11.28 725 531 



40 



50 



5.1 

8.4 975 600 



55 Example 7: Expression of Factor IX 

Fourteen ug of FIX(-G) — pUC13 were digested with 30 units of Bam HI in 30 ul of high salt buffer for 3 
hours at 37 °C. The DNA was then subjected to electrophoresis in 1% agarose and the 1.4 kb band 
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contining the Factor IX sequence was isolated from the gel. 

Three ug of the vector pD2 were digested with 30 units of Bam HI in 30 ul high salt buffer for 3 hours at 

37 • C. The DNA was subjected to electrophoresis in 1 % agarose and the linear 1 .5 kb fragment isolated. 

The DNA was then treated with 0.12 units calf alkaline phosphatase in 30 ul of 10 mM Tris-HCI pH 8, 1 mM 
5 EDTA for 30 minutes at 37 °C. The salt was adjusted to 0.3 M NaOAc and the sample extracted twice with 

phenol, once with chloroform and the DNA was ethanol precipitated. The pellet was rinsed in 70% ethanol, 

dried and redissolved in 20 ul 10 mM Tris-HCI pH 8, 1 mM EDTA. Equimolar amounts (0.02 pmoles) of the 

two fragments were ligated with 10 units of T* DNA ligase as described above. The reaction mixture was 

used to transform E coli RR1 cells. DNA from twelve of the resulting ampicillin-resistant colonies was 
10 screened by restriction enzyme digestion. One of the clones with the 1 .4 kb fragment inserted in the correct 

orientation was designated as FIX(-G)/pD2. E. coli RR1 transformed with FIX(-G)/pD2 has been deposited 

with ATCC under accession number 53067. 

BHK cells were co-transfected with FIX(-G)/pD2 and pKO-neo as described above. Drug-resistant cells 

were selected and prepared for ELISA and activity assay as described in Example 6. 
15 The assay for biological activity is based on the ability of Factor IX to reduce the clotting time of plasma 

from Factor IX-deficient patients to normal. It was done as described by Procter and Rapaport (Amer. J. 

Clin. Path. 36: 212, 1961). Results are shown in Table 4. 

TABLE 4 

20 

Factor IX Factor IX % active 

Cells/ml polypeptide (ng/ml) activity (ng/ml) protein in 
25 Day (xlO- 4 ) supernatant pellet in supernatant supernatant 



1 1.65 



30 



2 2.66 



57 
45 



20 
20 



27 
24 



50% 



35 



3 9.69 



150 
120 



60 
60 



72 
84 



58% 



40 



14 .79 



475 
225 



160 
140 



198 
150 



50% 



45 



50.85 



875 
1000 



250 
260 



408 
438 



45% 



The amount of Factor IX polypeptide was determined by ELISA essentially as described in Example 6 
so using polyclonal rabbit antisera to Factor IX. Following the incubation of the wells with the Factor IX- 
containing samples, the wells were rinsed and incubated 1 hour at room temperature with 200 ul of affinity 
purified rabbit polyclonal anti-Factor IX conjugated to alkaline phosphatase diluted 1:1000 in PBS containing 
1% BSA and 0.05% Tween 20. The wells were then rinsed four times with PBS containing 0.05% Tween 
20, and enzyme substrate was added as above. Incubations were run at 4* C overnight or 37 'C for 2 hours. 
55 As shown in Table 4, 70-80% of the Factor IX polypeptide is secreted into the media, and about 50% of 
this is biologically active. No Factor IX activity was detected in the cell pellets. 

Highest levels of activity were achieved by supplementing the cell culture medium with vitamin K 
(phytonadione, Merck) at concentrations of 1-10 mg/ml. 



22 



EP 0 200 421 B1 



Several additional analyses were performed to demonstate that the cells were secreting authentic Factor 
IX. Samples containing Factor IX activity according to the above assay were incubated with Factor VIII- 
deficient plasma but did not affect the clotting time, indicating that the activity was due to authentic Factor 
IX rather than a non-specific clotting agent. This conclusion was further verified by depletion of Factor IX 
5 activity from the samples with a specific antibody. Ninety-seven to ninety-eight percent of the Factor IX 
activity was immuno-precipitated from cell supernatants with a rabbit polyclonal antibody against Factor IX. 
This antibody also precipitated over 99% of the Factor IX activity from normal plasma. No Factor IX activity 
was removed from the supernatants by rabbit polyclonal antibody to erythropoietin. 

io Example 8: Construction of an expression vector for Factor VII. 

An expression vector comprising the synthetic Factor VII 5' coding region joined to the partial Factor VII 
cDNA was constructed. The vector, designated pM7135, was generated by inserting the Factor IX leader - 
5' Factor VII sequence from pM7115 and the 3' Factor VII sequence from FIX/VII/pD2 into plasmid pD3, 

75 which comprises the SV40 enhancer and the adenovirus 2 major late promoter and tripartite leader. 

Plasmid pD3 was generated from plasmid pDHFRIII. The Pst I site immediately upstream from the 
DHFR sequence in pDHFRIII was converted to a Bel I site by digesting 10 ug of plasmid with 5 units of Pst 
I for 10' at 37 'C in 100 ul buffer A (10 mM Tris pH 8, 10 mM MgCb, 6 mM NaCI, 7mM 0-MSH). The DNA 
was phenol extracted, EtOH precipitated, and resuspended in 40 ul buffer B (50 mM Tris pH 8, 7 mM 

20 MgCb, 7 mM 0-MSH) containing 10 mM dCTP and 16 units T4 DNA polymerase and incubated at 12 °C for 
60 minutes. Following EtOH precipitation, the DNA was ligated to 2.5 ug kinased Bel I linkers in 14 ul buffer 
C (10 mM Tris pH 8, 10 mM MgCI 2 , 1 mM DTT, 1.4 mM ATP) containing 400 units T4 polynucleotide ligase 
for 12 hours at 12 *C. Following phenol extraction and EtOH precipitation, the DNA was resuspended in 120 
ul buffer D (75 mM KCI, 6 mM Tris pH 7.5, 10 mM MgCI 2 , 1 mM DTT), digested with 80 units Bel I for 60 

25 minutes at 50 *C, then electrophoresed through agarose. Form III plasmid DNA (10 ug) was isolated from 
the gel, and ligated in 10 ul buffer C containing 50 units T4 polynucleotide ligase for 2 hours at 12 *C, and 
used to transform E. coli HB101. Positive colonies were identified by rapid DNA preparation analysis, and 
plasmid DNA (designated pDHFR') prepared from positive colonies was transformed into dAM~ E. coli. 
Plasmid pD2* was then generated by cleaving pDHFR' (15 ug) and pSV40 (25 ug) in 100 — ufbuffer D 

30 with 25 units Bel I for 60 minutes at 50 *C, followed by the addition of 50 units Bam HI and additional 
incubation at 37 'C for 60 minutes. DNA fragments were resolved by agarose gel electrophoresis, and the 
4.9 kb pDHFR' fragment and 0.2 kb SV40 fragment were isolated. These fragments (200 ng pDHFR* DNA 
and 100 ng SV40 DNA) were incubated in 10 ul buffer C containing 100 units T4 polynucleotide ligase for 4 
hours at 12 D C, and the resulting construct (pD2') used to transform E. coli RRI. 

35 Plasmid pD2' was modified by deleting the "poison" sequencesln the pBR 322 region (Lusky and 
Botchan, Nature 293: 79-81, 1981). Plasmids pD2* (6.6 ug) and pML-1 (Lusky and Botchan, ibid) (4 ug) 
were incubated in 50 ul buffer A with 10 units each Eco Rl and Nru I for 2 hours at 37 'C, followed by 
agarose gel electrophoresis. The 1.7 kb pD2' fragment and 1.8 kb pML-1 fragment were isolated and ligated 
together (50 ng each) in 20 ul buffer C containing 100 units T4 polynucleotide ligase for 2 hours at 12 *C, 

40 followed by transformation into E. coli HB101. Colonies containing the desired construct (designated A pD2) 
were identified by rapid preparatioTTanalysis. Ten ug of A pD2 were then digested with 20 units each Eco 
Rl and Bgl II, in 50 ul buffer A for 2 hours at 37 • C. The DNA was electrophoresed through agarose, and the 
desired 2.8 kb fragment (fragment C) comprising the pBR322, 3' splice site and poly A sequences was 
isolated. 

45 To generate the remaining fragments used in constructing pD3, pDHFRIII was modified to convert the 
Sac II (Sst II) site into either a Hind III or Kpn I site. Ten ug pDHFRIII were digested with 20 units Sst II for 2 
hours at 37 • C, followed by phenol extraction and ethanol precipitation. Resuspended DNA was incubated in 
100 ul buffer B containing 10 mM dCTP and 16 units T4 DNA polymerase for 60 minutes at 12 *C, phenol 
extracted, dialyzed, and ethanol precipitated. DNA (5 ug) was ligated with 50 ng kinased Hind III or Kpn I 

50 linkers in 20 ul buffer C containing 400 units T4 DNA ligase for 10 hours at 12 *C, phenol extracted, and 
ethanol precipitated. After resuspension in 50 ul buffer A, the resultant plasmids were digested with 50 units 
Hind III or Kpn I, as appropriate, and electrophoresed through agarose. Gel-isolated DNA (250 ng) was 
ligated in 30 ul buffer C containing 400 units T4 DNA ligase for 4 hours at 12 °C and used to transform E. 
coli RRI. The resultant plasmids were designated pDHFRIII (Hind III) and pDHFRIII (Kpn I). A 700 bp Kpnl- 

55 Bgl II fragment (fragment A) was then purified from pDHFRIII (Kpn I) by digestion with Bgl II and Kpn I 
followed by agarose gel electrophoresis. 

The SV40 enhancer sequence was inserted into pDHFRIII (Hind III) as follows: 50 ug SV40 DNA was 
incubated in 120 ul buffer A with 50 units Hind III for 2 hours at 37 'C, and the Hind III C SV40 fragment 
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(5089-968 bp) was gel purified. Plasmid pDHFRIII (Hind III) (10 ug) was treated with 250 ng calf intestinal 
phosphatase for 1 hour at 37 *C, phenol extracted and ethanol precipitated. The linearized plasmid (50 ng) 
was ligated with 250 ng Hind III C SV40 in 16 ul buffer C for 3 hours at 12 *C, using 200 units T4 
polynucleotide ligase, and transformed into E. coli HB101. A 700 base pair Eco Rl-Kpn I fragment (fragment 

5 B) was then isolated from this plasmid. 

For the final construction of pD3, fragments A and B (50 ng each) were ligated with 10 ng fragment C 
with 200 units T4 polynucleotide ligase for 4 hours at 12 °C, followed by transfection of E. coli RRI. Positive 
colonies were detected by rapid preparation analysis, and a large-scale preparation of pD3 was made. 

Expression vector pM7135 was then constructed. The replicative form of pM7115 was digested with 

w Bam HI and Xba I and the 550 base pair fragment comprising the Factor IX leader and 5' Factor VII 
sequence was gel purified. Plasmid FIX/VII/pD2 was digested with Xba I and Bam HI and the 1700 bp 
fragment comprising the 3* portion of the Factor VII cDNA was gel purified. Plasmid pD3 was digested with 
Bel I, treated with calf alkaline phosphatase, and the three fragments joined in a triple ligation. The resultant 
constructs were screened for the presence of a 2000 base pair Xba I fragment. A plasmid having the 

15 correct orientation was selected and designated pM7135 (Figure 8). 

Example 9: Expression of Factor VII From cDNA Clones. 

In order to express Factor VII cDNA containing a Factor VII leader, DNA from XVII2463 or XVII565 and 

20 XVII2463 was cloned into an expression vector containing the Ad2 major late promoter, SV40 enhancer 
sequences, the Ad2 tripartite leader, a splice set, and the SV40 polyadenylation signal. This vector was 
adapted so that it contains a unique EcoRI sequence as the site of cDNA insertion. The expression of 
sequences from XVII2463, which encodes a 60 amino acid leader, and from XVII565 and XVII2463, which 
lacks the codons for amino acids from -18 to -39 and thus encodes a leader 38 amino acids in length, were 

25 evaluated. Because the structure of the Factor VII leader has only been identified by cDNA cloning, and 
because of the ambiguity generated by having obtained two different 5' -terminal cDNAs, the inventors also 
constructed a genomic-cDNA Factor VII sequence. The 3' portion of XVII2463 (from the Bgl II site in exon 2, 
to the EcoRI site linkered 3' to the poly(A) tail) was adjoined to a subgenomic fragment of clone 7m 1, that 
encoding exons 1a, 1b and the remainder of exon 2. This subgenomic fragment, reconstructed as an EcoRI- 

30 Bglll 4.4 Kb fragment, was adjoined to the XVII2463 cDNA and cloned into a mammalian expression vector. 

Briefly, in order to construct the subclones, the Factor VII cDNA EcoRI fragment of XVII2463 was cloned 
into the EcoRI site of pUC18, and designated pVII2463. Similarly, the EcoRI cDNA insert of XVII565 was 
subcloned into pUC18, and designated pVII565. A hybrid between the 5' portion of the Factor VII sequence 
of clone pVII565 and the 3' segment of Factor VII DNA of pVII2463 was constructed by cloning the 5*-most 

35 EcoRI-Bgl II Factor VII fragment of pVII565 and the Bgl ll-Hind III Factor VII fragment (Hind III site in 
polylinker of pVII2463) of pVII2463 into pUC 18 digested with EcoRI and Hind III. This construct was 
designated pVII2397. The inserts of pVII2463 and pVII2397 were removed by EcoRI digestion and gel 
purified for insertion into mammalian expression vectors as described below. 

40 A. Expression of full-length Factor VII cDNA. 

The expression of Factor VII was achieved in the vector pDX. This vector was derived from pD3 
(described in Example 8 above) and pD3', a vector identical to pD3 except that the SV40 polyadenylation 
signal (i.e. the SV40 Bam HI [2533 bp] to Bell [2770bp] fragment) is in the late orientation. Thus, pD3* 

45 contains a Bam HI site as the site of gene insertion. 

To generate pDX, the EcoRI site in pD3' was converted to a Bel I site by Eco Rl cleavage, incubation 
with S1 nuclease, and subsequent ligation with Bel I linkers. DNA was prepared from a positively identified 
colony, and the 1.9 kb Xhol-Pstl fragment containing the altered restriction site was prepared via agarose 
gel electrophoresis. In a second modification, Bel l-cleaved pD3 was ligated with kinased Eco Rl-Bcl I 

50 adaptors (constructed from oligonucleotides ZC 525, 5 GGAATTCT 3 '; and ZC526, 5 G ATC AG AATTCC 3 ') in 
order to generate an Eco Rl site as the position for inserting a gene into the expression vector. Positive 
colonies were identified by restriction endonuclease analysis, and DNA from this was used to isolate a 2.3 
kb Xhol-Pstl fragment containing the modifed restriction site. The two above-described DNA fragments were 
incubated together with T4 DNA ligase, transformed into E. coli HB101 and positive colonies were identified 

55 by restriction analysis. A preparation of such DNA, termed pDX, was made (Figure 12). This DNA was 
cleaved with Eco Rl and subsequently incubated with calf-intestinal phosphatase. The purified DNA was 
then incubated with T4 DNA ligase and the Factor VII Eco Rl fragment from pVII2463, or with the Factor VII 
Eco Rl cDNA fragment derived from pVII2397. The resultant clones were designated FVII(2463)/pDX and 
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FVII(565 + 2463)/pDX, respectively (Figure 12). After transformation into E. coli JM83 and subsequent 
identification by restriction enzyme analysis, plasmid DNA preparations ~we7e~ made and checked by 
extensive restriction endonuclease digestion. The plasmids FVII(2463)/pDX and FVII(565 + 2463)/pDX have 
been deposited with American Type Culture Collection and have been assigned accession numbers 40206; 

5 and 40205, respectively. 

FVII(2463)/pDX and FVII(565 + 2463)/pDX (10 ug each) were each transfected, along with 10 ug salmon 
sperm carrier DNA, into either BHK tk"ts13 cells (Floros et al. f Exper. Cell Res. 132: 215-223, 1981) or COS 
cells, using standard calcium-phosphate precipitation. Following transfection, the~cells were cultured in the 
appropriate media containing 5 ug/ml vitamin K for two days. At this time, the supernatants were assayed 

70 for ELISA-positive material, using a monoclonal antibody directed against Factor VII. Both FVII(2463)/pDX 
and FVII(565 + 2463)/pDX directed the production of Factor VII polypeptide which was detected in COS cell 
supernatants, and Factor VII from FVII(565 + 2463)pDX was detected in BHK cell supernatant. Sham- 
transfected BHK cells or COS cells did not yield detectable levels of Factor VII (Table 5). 

75 

TABLE 5 

ELISA Positive 
Cell Cell Material (ng/ml 

20 DNA Line Number Culture Medium) 

FVII(2463)/pDX COS 2 x 10 6 15 

FVIK 565+2463 >/pDX COS 2 x 10 6 12 

25 Control COS 2 x 10 6 <2 

FVII(2463)/pDX BHK 9 x 10 6 62 

FVIK 565+2463 )/pDX BHK 9 x 10 6 6 

30 Control BHK 9 x 10 6 <2 
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Transient expression of Factor VII and also tested in several other cell lines, listed in Table 6. 



Name 

1. Rat Hep I 

2. Rat Hep II 

3. TCMK 

4. Human lung 



5 . Human 
hepatoma 



6. Hep 62 



TABLE 6 
Description 

Rat hepatoma H4-II-E-C3 

Rat hepatoma H4-II-E 

Mouse Kidney, SV40 virus 
transformed, TCMK-1 

SV40 virus transformed 
WI-38 VA13, subline 2RA 

Human liver adenocarcinoma 
SK-HEP-1 

Human hepatoma, dev. by 
Barbara Knowles/Wistar 
Institute 



7. Mouse liver NCTC clone 1469 



Reference 
(ATTC ») 

CRL 1600 

CRL 1548 

CCL 139 

CCL 75.1 

HTB-52 

HTB 8065 

CC 29.1 



8. COS 

9. BHK 

10. 293 

11. DUKX 



SV40-transf ormed CV-1 
(monkey) cells 

Baby hamster kidney 
BHK-21 (C-13) 

Human embryonic 
Kidney/ Ad transformed 

CHO-DHFR sens 



CRL 1650 



CCL 10 



CRL 1573 

(Urlaub & 
Chasin, 
PKAS (USA) 
7_7: 4216- 
4220, 1980) 



Cells were cotransfected with 10 ug of either FVII(2463)/pDX or FVII(565 + 2463)/pDX together with 1 
ug of a plasmid comprising the chloramphenicol acetyl transacetylase gene (to permit identification of 
cotransfected cells) and 10 ug of salmon sperm DNA. Mock transfected cells were used as controls. Spent 
media were assayed by ELISA after six days. Results are given in Table 7. 
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TABLE 7 

Cell 



Sample 


Cell Line 


Pi asm id 


Number 

v 1 A""6 
X J.U 


ELISA 
l iiy/niX 


1. 


Rat Hep I 


Mock 


11 .D 


2.4 


2. 


Rat Hep I 


FVII (565+2463 )/pDX 


7.0 


2 


3. 


Rat Hep I 


FVII (2463 )/pDX 


7.0 


<2 


4. 


Rat Hep 2 


Mock 


13 .0 


<2 


5. 


Rat Hep 2 


FVII (565+2463 )/pDX 


18 .4 


<2 


6. 


Rat Hep 2 


FVII(2463)/pDX 


14.6 


<2 


7. 


TCMK 


Mock 


18.8 


<2 


8. 


TCMK 


FVII(565+2463)/pDX 


9.8 


<2 


9. 


TCMK 


FVII ( 2463 )/pDX 


12.8 


<2 


10. 


Human Lung 


Mock 


7.2 


<2 


11. 


Human Lung 


FVII(565+2463)/pDX 


3.4 


16.5 
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29. 


293 


FVII(565+2463)/pDX 


6.2 


94 


30. 


293 


FVII(2463)/pDX 


8.2 


100 


31. 


DUKX 


Mock 


11.6 


<2 


32. 


DUKX 


FVIK 565+2463 )/pDX 


13.0 


<2 


33. 


DUKX 


FVII(2463)/pDX 


13.6 


<2 



FVII(2463)/pDX (10 ug) or FVII(565 + 2463)/pDX (10 ug) was co-transfected with 10 ug of salmon sperm 
DNA and 1 ug of a plasmid encoding the resistant form of dihydrofolate reductase (Simonsen and Levinson, 
Proc. Natl. Acad. Sci. USA 80: 2495-2499, 1983) in a mammalian expression vector, into BHK tk"ts13 cells. 
After two days, the cells weTe split 1:14 and placed into selective media containing either 250 nM or 1000 
nM methotrexate (MTX) and 5 ug/ml vitamin K (phytadione, Merck). After two weeks, colonies were isolated 
and grown to 50-90% confluency. The supernatant media were then assayed for Factor Vll polypeptide by 
ELISA. Of the 25 positive clones, 22 were analyzed further. The cells were plated at 5 x 10 4 (Group I) or 1 x 
10 s (Group II) in 10 cm dishes containing 5 ug/ml vitamin K, and either 250 nM or 1000 nM methotrexate. 
Five days later, the faster growing clones (designated by an asterisk in Table 8) were split 1 :2, then after 24 
hours, the media were changed on all plates. Twenty-three hours (Group I) or 20 hours (Group II) later, 
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supernatant media were harvested and cell counts were taken for each clone. The media were assayed 
both by ELISA and by the one-stage clotting assay. Results are shown in Table 8. 

TABLE 8 
GROUP I - 23-hour assay 

ELISA Clot- 
Cell (pg/ ELISA ting 



Clone 


Count 

Plasmid X10 -5 


ceii/ 
day) 


/ nrr / 
i ng/ 

ml) 


ml) 


B 

V 

Active 


B4-A1* 


FVIK 565+2463 )/pDX 


2 


6.5 


130 


206 


158 


B4-B1* 


FVIK 565+2463 )/pDX 


27 


1.9 


513 


360 


70 


B4-C1 


FVII(565+2463)/pDX 


16 


2.5 


393 


480 


122 


B4-C2* 


FVII(565+2463)/pDX 


9 


<0.2 


<20 


21 




B4-C3 


FVIK 565+2463 )/pDX 


52 


1.5 


800 


910 


114 


B4-D1* 


FVIK 565+2463 )/pDX 


27 


2.0 


553 


570 


103 


B4-D2* 


FVIK 565+2463 )/pDX 


13 


1.2 


150 


154 


103 


B4-E1 


FVIK 565+2463 )/pDX 


39 


2.2 


870 


1160 


133 


B4-E2* 


FVIK 565+2463 )/pDX 


8 


2.5 


205 


240 


117 


B4-E3 


FVII(565+2463)/pDX 


23 


1.2 


275 


320 


116 


B4-E4 


FVIK 565+2463 )/pDX 


31 


1.3 


410 


300 


73 


B3-5.3* 


FVIK 2463 )/pDX 


5 


8.2 


410 


290 


70 



GROUP II - 20-hour assay 



Clone 


Plasmid 


Cell 
Count 
X10" 5 


ELISA 

(pg/ 

cell/ 
day) 


ELISA 
(ng/ 
ml) 


Clot- 
ting 
(ng/ 
ml) 


% 

Active 


B3-2.2 


FVIK2463)/pDX 


41 


2.5 


1043 


500 


48 


B3-2.3 


FVIK2463)/pDX 


19 


3.0 


580 


610 


105 


B3-3.2 


FVIK2463)/pDX 


13 


1.5 


197 


216 


110 


B3-4.2 


FVIK 2463 )/pDX 


41 


1.8 


760 


620 


82 


B3-5.1 


FVIK 2463 )/pDX 


14 


3.3 


460 


400 


87 


B3-5.2 


FVIK 2463 )/pDX 


9 


2.7 


257 


184 


72 


B6-D 


FVIK2463)/pDX 


54 


3.0 


1700 


780 


46 


B6-E 


FVIK2463)/pDX 


101 


1.6 


1640 


970 


59 


B6-G 


FVII(2463)/pDX 


31 


5.7 


1853 


1080 


58 


B6-M 


FVIK 2463 )/pDX 


75 


2.2 


1743 


940 


54 
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B. Expression of Factor VII genomic-cDNA hybrid. 

An expression vector containing genomic sequences representing the Factor VII genomic 5'-terminus 
and cDNA sequences from the Factor VII gene 3'-terminus was prepared as follows. Three subclones of the 

5 genomic plasmid 7 ml were used to reconstruct the 5'-terminus: 7Bam, 7SD and 7SE. Plasmid 7Bam is a 
3.6 Kb EcoRI-Bam HI fragment containing exon 1a, subcloned into pUC"l2. A 0.7 Kb EcoRI-Xbal fragment, 
which contains exon 1a, was isolated from this subclone and is designated fragment a. Plasmid 7SD is a 3.7 
Kb Sstl fragment containing exon 1b, subcloned into pUC18. An exon 1 b-containing 3.1 Kb Xbal-Sstl 
fragment was isolated from this subclone and is designated fragment b. Plasmid 7SE is a 3.9 Kb Sstl 

w fragment containing exons 2-4, subcloned into M13mp 19. An Sstl-Bgl I! (0.6 Kb) fragment containing the 5' 
part of exon 2 was gel isolated and is designated fragment c. The remainder of the 3*-Factor VII cDNA 
(fragment d) was obtained as a 2 Kb Bglll-EcoRI fragment from pUCVII2463. Fragments a-d were ligated 
with EcoRI-cleaved and calf intestinal-phosphatased pDX, and then transformed into E. coli JM83 or HB101. 
Positive colonies were identified by restriction endonuclease analysis, and plasmid DNAwas prepared from 

75 these colonies. 

For expression of Factor VII, the plasmid DNA is co-transfected into BHK or COS cells as described 
above. Transfected cells are cultured in vitamin K-containing medium for 2 days, and the medium is 
assayed for Factor VII by ELISA. 

From the foregoing it will be appreciated that, although specific embodiments of the invention have 
20 been described herein for purposes of illustration, various modifications may be made. Accordingly, the 
invention is not to be limited except by the appended claims. 

The various strains of E. Coli used in the foregoing Examples were the personal choice of the inventors. 
Persons skilled in this arTwliTappreciate that other suitable strains of E. Coli could be substituted (for 
example if the person already has in his possession samples of other suitable E. Coli strains). 
25 Transformants ATCC 53067 and 53068 were deposited with the American Type Culture Collection on 
28 March 1985 and plasmids ATCC 40205 and 40206 were deposited with the American Type Culture 
Collection on 25 November 1985, all four deposits being made in accordance with the Budapest Treaty. 

The features disclosed in the foregoing description, in the following claims and/or in the accompanying 
drawings may, both separately and in any combination thereof, be material for realising the invention in 
30 diverse forms thereof. 

Claims 

1. A DNA construct containing a nucleotide sequence encoding human factor VII having an amino acid 
35 sequence as shown in Figure 1 b. 

2. The DNA construct of Claim 1 wherein said nucleotide sequence also encodes a leader peptide. 

3. The DNA construct of Claim 1 wherein said nucleotide sequence includes a synthesized double- 
40 stranded oligonucleotide. 

4. The DNA construct of Claim 3 wherein said synthesized double-stranded oligonucleotide codes for the 
amino-terminal portion of Factor VII. 

45 5. The DNA construct of Claim 1 wherein at least a portion of said nucleotide sequence is derived from a 
cDNA clone or a genomic clone of Factor VII. 

6. The DNA construct of Claim 1 comprising a first nucleotide sequence derived from a cDNA or a 
genomic clone of Factor VII, joined to a second nucleotide sequence positioned downstream of said 

50 first sequence, said second sequence derived from a cDNA clone of Factor VII, the joined sequences 
coding for a protein which upon activation has Factor Vila biological activity for blood coagulation. 

7. The DNA construct of Claim 1 wherein said nucleotide sequence comprises the cDNA sequence of 
Figure 1b, from bp 36 to bp 1433. 

55 

8. The DNA construct of Claim 1 wherein said nucleotide sequence comprises the cDNA sequence of 
Figure 1b, from bp 36 to bp 99, followed downstream by the sequence from bp 166 to bp 1433. 
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9. The DNA construct of Claim 1 wherein said nucleotide sequence codes for the amino acid sequence of 
Figure 1b from alanine, amino acid number 1, to proline, amino acid number 406. 

10. A recombinant plasmid capable of integration in mammalian host cell DNA, said plasmid including a 
5 promoter followed downstream by a nucleotide sequence according to any of the previous Claims 1 to 

9, said nucleotide sequence being followed downstream by a polyadenylation signal. 

11. Mammalian cells transfected with a recombinant plasmid according to Claim 10. 

w 12. A method for producing a protein having biological activity for blood coagulation mediated by Factor 
Vila, comprising: 

establishing a mammalian host cell which contains a DNA construct containing a nucleotide 
sequence encoding human factor VII; 

growing said mammalian host cell in an appropriate medium which contains vitamin K; 
15 isolating the protein product encoded by said DNA construct produced by said mammalian host 

cell; and 

activating said protein product to generate a protein which has Factor Vila biological activity for 
blood coagulation. 

20 13. The method of Claim 12, including amplification of the DNA construct by cotransfection of the host cell 
with a gene encoding dihydrofolate reductase, wherein the appropriate medium comprises methotrex- 
ate. 

14. The method of Claim 12 wherein said protein product is activated by reacting the protein with a 
25 proteolytic enzyme selected from the group consisting of Factor XI la, Factor IXa, kallikrein, Factor Xa, 

and thrombin. 

15. A pharmaceutical preparation for the treatment of bleeding disorders containing a protein having an 
amino acid sequence as shown in Figure 1b and free of contaminating human proteins. 

30 

16. A method of producing a protein having biological activity for blood coagulation mediated by Factor 
Vila, which process comprises growing in an appropriate medium which contains vitamin K an 
established mammalian host cell which contains a DNA construct containing a nucleotide sequence 
encoding human factor VII, isolating the protein product encoded by said DNA construct produced by 

35 said mammalian host cell and activating said protein product to generate a protein which has Factor 
Vila biological activity for blood coagulation. 

17. The method of Claim 12, 13, 14 or 16 wherein said host cell is a non-hepatic cell. 

40 18. A method of preparing a pharmaceutical composition for the treatment of bleeding disorders, which 
method comprises preparing a pharmaceutical composition containing a protein produced by the 
method of any one of Claims 12, 13, 14, 16 and 17. 

PatentansprUche 

45 

1. DNA-Aufbau, der eine Nukleotidsequenz enthalt, welche menschlichen Faktor VII kodiert, mit einer 
Aminosauresequenz wie in Figur 1b gezeigt. 

2. DNA-Aufbau nach Anspruch 1, bei dem die Nukleotidsequenz auch ein Leader-Peptit kodiert: 

50 

3. DNA-Aufbau nach Anspruch 1, bei dem die Nukleotidsequenz ein synthetisiertes doppelstrangiges 
Oligonukleotid umfaBt. 

4. DNA-Aufbau nach Anspruch 3, bei dem das synthetisierte doppelstrangige Oligonukleotid flir den 
55 Aminoende-Abschnitt des Faktors VII kodiert. 

5. DNA-Aufbau nach Anspruch 1, bei dem wenigstens ein Abschnitt der Nukleotidsequenz von einem 
cDNA-Klon oder einem genomischen Klon des Faktors VII abgeleitet ist. 
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6. DNA-Aufbau nach Anspruch 1, welcher eine erste Nukleotidsequenz aufweist, die von einem cDNA- 
oder einem genomischen Klon des Faktors VII abgeleitet ist, angebunden an eine zweite Nukleotidse- 
quenz, die stromabwarts der ersten Sequenz angeordnet ist, wobei die zweite Sequenz von einem 
cDNA-Klon des Faktors VII abgeleitet ist, wobei die verbundenen Sequenzen fur ein Protein kodieren, 

5 das nach der Aktivierung die biologische Aktivitat des Faktors Vila fur Blutgerinnung hat. 

7. DNA-Aufbau nach Anspruch 1, bei dem die Nukleotidsequenz eine cDNA-Sequenz der Figur 1b umfaBt, 
von Basenpaar 36 bis Basenpaar 1433. 

w 8. DNA-Aufbau nach Anspruch 1, bei dem die Nukleotidsequenz die cDNA-Sequenz der Figur 1b 
aufweist, von Basenpaar 36 bis Basenpaar 99, stromaufwarts gefolgt von der Sequenz von Basenpaar 
166 bis Basenpaar 1433. 

9. DNA-Aufbau nach Anspruch 1, bei dem die Nukleotidsequenz fur die Aminosauresequenz der Figur 1b 
75 von Alanin, Aminosaure Nr. 1, bis Prolin, Aminosaure Nr. 406 kodiert. 

10. Rekombinantes Plasmid, geeignet zur Integration in Saugetier-Wirtszellen-DNA, wobei das Plasmid 
einen Promoter umfaBt, der stromabwarts von einer Nukleotidsequenz gemaS einem der vorangehen- 
den Anspruche 1 bis 9 gefolgt ist, wobei die Nukleotidsequenz stromabwarts von einem Polyadenilie- 

20 rungssignal gefolgt ist. 

11. Saugetierzellen, transfiziert mit einem rekombinanten Plasmid nach Anspruch 10. 

12. Verfahren zum Produzieren eines Proteins mit biologischer Aktivitat fur Blutgerinnung, vermittelt durch 
25 Faktor Vila, umfassend: 

Einrichten einer Saugetier-Wirtszelle, welche einen DNA-Aufbau enthalt, der eine menschiischen Faktor 
VII kodierende Nukleotidsequenz enthalt; 

Zuchten der Saugetier-Wirtszelle in einem geeigneten Medium, welches Vitamin K enthalt; 
Isolieren des Protein prod uktes, kodiert von dem DNA-Aufbau, das von der Saugetier-Wirtszelle produ- 
30 ziert worden ist; und 

Aktivieren des Protein produktes, urn ein Protein zu erzeugen, das eine biologische Aktivitat fur 
Blutgerinnung von Faktor Vila hat. 

13. Verfahren nach Anspruch 12, einschliefllich der Verstarkung des DNA-Aufbaus durch Kotransfektion der 
35 Wirtszelle mit einem Gen, das die Hydrofolat-Reduktase kodiert, wobei das geeignete Medium 

Methotrexat aufweist. 

14. Verfahren nach Anspruch 12, bei dem das Protein prod ukt aktiviert wird, indem das Protein mit einem 
proteolytischen Enzym reagiert wird, das aus der Gruppe bestehend aus Faktor Xlla, Faktor IXa, 

40 Kalikrein, Faktor Xa und Thrombin ausgewahlt ist. 

15. Pharmazeutische Preparation fUr die Behandlung von Blutungserkrankungen, welche ein Protein mit 
einer Aminosauresequenz wie in Figur 1b gezeigt und frei von kontaminierenden menschlichen 
Proteinen enthalt. 

45 

16. Verfahren zum Produzieren eines Proteins mit biologischer Aktivitat fUr Blutgerinnung, vermittelt durch 
Faktor Vila, bei dem der ProzeB das Zuchten einer eingerichteten Saugetier-Wirtszelle, welche einen 
DNA-Aufbau, der eine menschlichen Faktor VII kodierende Nukleotidsequenz enthalt, enthalt, in einem 
geeigneten Medium, das Vitamin K enthalt, das Isolieren das von diesem DNA-Aufbau kodierten 

50 Proteinproduktes, das von der Saugetier-Wirtszelle produziert worden ist, und das Aktivieren des 
Proteinproduktes, urn ein Protein zu erzeugen, das die biologische Aktivitat fur Blutgerinnung von 
Faktor Vila hat, aufweist. 

17. Verfahren nach Anspruch 12, 13, 14 oder 16, bei dem die Zelle eine nicht-hepatische Zelle ist. 

55 

18. Verfahren zum Praparieren einer pharmazeutischen Zusammensetzung fur die Behandlung von Blu- 
tungserkrankungen, wobei das Verfahren das Praparieren einer pharmazeutischen zusammensetzung 
umfaBt, die ein durch das Verfahren nach einem der AnsprOche 12, 13, 14, 16 und 17 produziertes 
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Protein enthait. 
Revendicatlons 

5 1. Produit de recombinaison d'ADN contenant une sequence de nucleotides codant pour le facteur 
humain VII, ayant une sequence d'amino-acides telle que representee sur la figure 1b. 

2. Produit de recombinaison d'ADN suivant la revendication 1, dans lequel la sequence de nucleotides 
code aussi pour un peptide leader. 

w 

3. Produit de recombinaison d'ADN suivant la revendication 1, dans lequel la sequence de nucleotides 
renferme un oligonucleotide double brin synthetise. 

4. Produit de recombinaison d'ADN suivant la revendication 3, dans lequel ('oligonucleotide double brin 
75 synthetise code pour la portion terminate amino du facteur VII. 

5. Produit de recombinaison d'ADN suivant la revendication 1, dans iequel au moins une portion de la 
sequence de nucleotides est derivee d'un clone d'ADNc ou d*un clone genomique du facteur VII. 

20 6. Produit de recombinaison d'ADN suivant la revendication 1, comprenant une premiere sequence de 
nucleotides derivee d'un clone d'ADNc ou d'un clone genomique du facteur VII, nee a une seconde 
sequence de nucleotides disposee en aval de la premiere sequence, la seconde sequence etant 
derivee d'un clone d'ADNc du facteur VII, les sequences liees codant pour une proteine qui, par 
activation, possede I'actitive biologique du facteur Vila pour la coagulation du sang. 

25 

7. Produit de recombinaison d'ADN suivant la revendication 1, dans lequel la sequence de nucleotides 
comprend la sequence d'ADNc de la figure 1b, de la paire de bases 36 a la paire de bases 1433. 

8. Produit de recombinaison d'ADN suivant la revendication 1, dans lequel la sequence de nucleotides 
30 comprend la sequence d'ADNc de la figure 1b, de la paire de bases 36 a la paire de bases 99, suivie 

en aval de la sequence allant de la paire de bases 166 a la paire de bases 1433. 

9. Produit de recombinaison d'ADN suivant la revendication 1, dans lequel la sequence de nucleotides 
code pour la sequence d'amino-acides de la figure 1b allant de I'amino-acide, alanine, numero 1 a 

35 I'amino-acide, proline, numero 406. 

10. Plasmide recombinant susceptible d'integration dans I'ADN d'une cellule-hote de mammif&re, ledit 
plasmide comprenant un promoteur suivi en aval d'une sequence de nucleotides suivant Tune 
quelconque des revendications 1 a 9, ladite sequence de nucleotides etant suivie en aval d'un signal 

40 de polyadenylation. 

11. Cellules de mammifere transferees avec un plasmide recombinant suivant la revendication 10. 

12. Procede de production d'une proteine douee d'activite biologique pour la coagulation du sang sous la 
45 mediation du facteur Vila, comprenant : 

retablissement d'une cellule-hote de mammifere qui contient un produit de recombinaison d'ADN 
comportant une sequence de nucleotides codant pour le facteur humain VII ; 

la croissance de ladite cellule-hote de mammif&re dans un milieu approprie qui contient de la 
vitamine K ; 

50 I'isolement du produit proteique code par le produit de recombinaison d'ADN eiabore par la cellule- 

hote de mammifere ; et 

I'activation dudit produit proteique pour engendrer une proteine qui a I'activite biologique du facteur 
Vila pour la coagulation du sang. 

55 13. Procede suivant la revendication 12, comportant ('amplification du produit de recombinaison d'ADN par 
co-transfection de la cellule-hote avec un g£ne codant pour la dihydrofolate-reductase, ou le milieu 
approprie comprend du methotrexate. 
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14. Procede suivant la revendication 12, dans lequel le produit proteique est active par reaction de la 
proteine avec un enzyme proteolytique choisi dans le groupe comprenant le facteur Xlla, le facteur IXa, 
la kallicreine, le facteur Xa et la thrombine. 

s 15. Preparation pharmaceutique destinee au traitement de troubles h£morragiques, contenant une proteine 
ayant une sequence d'amino-acides telle que representee sur la figure 1 b et ddpourvue de prolines 
humaines contaminantes. 

16. Procede de production d'une proteine douee d'activite biologique pour la coagulation du sang sous la 
w mediation du facteur Vila, procede qui comprend la croissance, dans un milieu approprie contenant de 

la vitamine K, d'une cellule-hote de mammifere 6tablie qui contient un produit de recombinaison d'ADN 
contenant une sequence de nucleotides codant pour le facteur humain VII, I'isolement du produit 
proteique code par le produit de recombinaison d'ADN eiabore" par ladite cellule-hote de mammifere et 
Pactivation du produit proteique pour engendrer une proteine qui a I'activite biologique du facteur Vila 
75 pour la coagulation du sang. 

17. Procede* suivant la revendication 12, 13, 14 ou 16, dans lequel la cellule-hote est une cellule non 
hepatique. 

20 1a Procede de preparation d'une composition destinee au traitement de troubles hemorragiques, procede 
qui comprend la preparation d'une composition pharmaceutique contenant une proline produite par le 
proc£de* suivant Tune quelconque des revendications 12, 13, 14, 16 et 17. 
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FIG. HA 
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71* 729 7** 

CGG ACC TTC TCT GAG AGG ACG CTG 6CC TTC GTG CCC TTC TCA TTG CTC AGC GGC 
Arg Thr Phe $er Glu Arg Thr Leu All Phe Val Arg Phe Ser Leu Val Ser Cly 

759 77* Nar I 789 80* 

TQG GGC CAG CTG CTG GAC CGT GGC GCC ACG GCC CTG GAG CTC ATG GTC CTC AAC 

Trp Cly Gin Leu Leu Asp Arg Gly Ala Thr AU Leu Glu Leu Het Vet .Leu Asn 

819 83* Pst lb 8*9 86* 

GTG CCC CGG CTG ATG ACC CAC GAC TCC CTG CAG CAG TCA CGG AAG GTG GGA GAC 
Val Pro Arg Leu Net Thr Gin Asp Cys Leu Gin Gin Ser Arg Lys Val Gly Asp 

879 89* 909 

TCC CCA AAT ATC ACG GAG TAC ATG TTC TGT GCC GGC TAC TCG GAT GGC AGC AAG 
Ser Pro Asn Me Thr Glu Tyr Met Phe Cys Ala Gly Tyr Ser Asp Gly Ser Lys 

92* 939 95* 969 

GAC TCC TGC AAG GGG GAC AGT GGA GGC CCA CAT GCC ACC CAC TAC CGG GGC ACG 
Asp Ser Cys Lys Gly Asp Ser Gly Gly Pro His Ala Thr His Tyr Arg Gly Thr 

98* 999 101* 

TGG TAC CTG ACG GGC ATC GTC AGC TCG GGC CAG GGC TGC GCA ACC GTG GGC CAC 
Trp Tyr Leu Thr Gly lie Val Ser Trp Cly Gin Gly Cys Ala Thr Val Gly His 

1029 1 0** 1059 TaqI 107* 

TTT GGG GTG TAC ACC AGG GTC TCC CAG TAC ATC GAG TGG CTG CAA AAG CTC ATG 
Phe Gly Val Tyr Thr Arg Val Ser Gin Tyr lTe~G1u Trp Leu Gin Lys Leu Met 

1089 1 10* 1119 "38 

CGC TCA GAG CCA CGC CCA GCA GTC CTC CTG CGA CCC CCA TTT CCC TAG CCCAGCAGCC 
Arg Ser Glu Pro Arg Pro Gly Val Leu Leu Arg Ala Pro Phe Pro 

Pstlc 

11*8 1158 1168 1178 1188 1198 1208 

CTGGCCTGTG GAGAGAAAGC CAAGGCTGCG TCGAACTGTC CTGGCACCAA ATC CC AT ATA TTCTTCTGCA 



1218 1228 1238 12*8 1258 1268 1278 

GTTAATGGGG TAGAGGAGGG CATGGGAGGG AGGGAGAGGT GGGGAGCGAG ACAGAGACAG AAACAGAGAG 



1288 1298 1308 1318 1328 1338 13*8 

AGACAGAGAC AGAGAGAGAC TGAGGGAGAG ACTCTGAGGA CCATGGACAG AGACTCAAAG AGACTCCAAG 



1358 1368 1378 1388 1398 1*08 1*18 

ATTCAAAGAG ACTAATAGAG ACACAGAGAT GGAATACAAA AG AT GAG AGG CA6AGGCAGA CAGGCGCTCG 



1*28 1*38 1**8 1*58 1*68 1*78 1*88 

ACA6AGGGGC AGGG6AGTGC CAAGGTTGTC CTGGAGGCAG ACAGCCCACC T6AGCCTCCT TACCTCCCTT 



36 



EP 0 200 421 B1 



1498 1508 1518 ^ 1528 ~ 1538 15*8 1558 

CAGCCAAGCC CCACCTGCAC CTGATCTCCT GGCCCTCAGG CTGCTCCTCT GCCTTCATTC CTGGACACAC 



1568 1578 1588 1598 1608 1618 1628 

TAGAGCCATG ACACACATCG ATGCACACAC ACACACGCCA TGCACACACA CAGAGATATC CACACACACC 



1638 16*8 1658 1668 1678 1688 I698 

GATGCACACA CAGATGGTCA CACAGAGTAC GCAAACACAC CGATGCACAC GCACATAGAG ATATGCACAC 



1708 1718 1728 1738 17*8 1758 1768 

ACAGATGCAC ACACAGATAT AC AC AT G GAG TGCACGCACA TCCCAATGCA CGCACACATC AGTGCACACG 



1778 1788 1798 1808 1818 1828 1838 

GATGCACAGA GATATGCACA CACCGATGTG CGCACACACA GATATCCACA CACATCGATG AGCACACACA 



1848 1858 1868 1878 1888 I898 1908 

CACCAAGTGC GCACACACAC CGATGTACAC ACAGATGCAC ACACAGATGC ACACACACCG ATGCTGACTC 



1918 1928 1938 19*8 1958 1968 1978 

CATGTGTCCT GTCCTCTGAA GGCCGTTGTT TAGCTCTCAC TTTTCTGGTT CTTATCCATT ATCATCTTCA 



1988 1998 2008 2018 2028 2038 2048 

CTTCAGACAA TTCAGAAGCA TCACCATCCA TGGTGGCGAA TGCCCCCAAA CTCTCCCCCA AATCTATTTC 



2058 2068 2078 2088 2098 2108 2118 

TCCCTTCGCT GGGTGCCGGG CTGCACAGAC TATTCCCCAC CTGCTTCCCA GCTTCACAAT AAACGCCTGC 



2128 2138 2148 2158 2168 EcoRIb 

GTCTCCTCGC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AACGAATTC 
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FIGolB -60 

MetValSerGlnAlaLeuArgLeuLeu 
TCAACAGGCAGGGGCAGCACTGCAGAGATTTCATCATGGTCTCCCAGGCCCTCAGGCTCCTC 
10 20 30 40 50 60 

-50 -40 i 

CysLeuLeuLeuGlyLeuGlnGlyCysLeuAlaAlaGlyGlyValAlaLysAlaSerGlyGly 
TGCCTTCTGCTTGGGCTTCAGGGCrGCCTGGCrGCAGGCGGGGTCGCTAAGGCCPCAGGAGGA 
70 80 90 100 110 120 

-30 -20 J 

GluThr Arg AspMet ProTrpLy s ProGly ProH i s Arg Val PheValThrGlnGluGlu 
GAAACACGGGACATGCCGTGGAAGCCGGGGCCTCACAGAGTCTTCGTAACCCAGGAGGAA 
130 140 150 160 170 180 

-10 -1 +1 +10 

AlaHisGlyValLeuHisArgArgArgArgAlaAsnAlaPheLeuGluGluLeuArgPro 
GCCCACGGCGTCCTGCACCGGCGCCGGCGCGCCAACGCGTTCCTGGAGGAGCTGCGGCCG 
190 200 210 220 230 240 

+20 +30 
GlySerLeuGluArgGluCysLysGluGluGlnCysSerPheGluGluAlaArgGluTle 
GGCTCCCTGGAGAGGGAGTGCAAGGAGGAGCAGTGCTCCTTCGAGGAGGCCCX5GGAGATC 
250 260 270 280 290 300 

+40 +50 
PheLysAspAlaGluArgThrLysLeuPheTrpIleSerTyrSerAspGlyAspGlnCys 
TTCAAGGACGCGGAGAGGACGAAGCTGTTCTGGATTTCTTACAGTGATGGGGACCAGTGT 
310 320 330 340 350 360 

+60 +70 
AlaSerSerProCysGlnAsnGlyGlySerCysLysAspGlnLeuGlnSerTyrlleCys 
GCCTCAAGTCCATGCCAGAATGGGGGCTCCTGCAAGGACCAGCTCCAGTCCTATATCTGC 
370 380 390 400 410 420 

+80 +90 
PheCysLeuProAlaPheGluGlyArgAsnCysGluThrHisLysAspAspGlnLeuIle 
TTCTGCCTCCCTGCCTTCGAGGGCCGGAACTGTGAGACGCACAAGGATGACCAGCTGATC 
430 440 450 460 470 480 

+100 +110 
CysValAsnGluAsnGlyGlyCysGluGlnTyrCysSerAspHisThrGlyThrLysArg 
T GTGTG AACG AG A ACGGCGG CTGTG AGC AGT ACTGC AGTG ACC AC ACGGGC ACC AAGCGC 
490 500 510 520 530 540 

+120 +130 
SerCysArgCysHisGluGlyTyrSerLeuLeuAlaAspGlyValSerCysThrProThr 
TCCTGTCGGTGCCACGAGGGGTACTCTCTGCTGGCAGACGGGGTGTCCTGCACACCCACA 
550 560 570 580 590 600 

+140 +150 
ValGluTyrProCysGlyLysIleProIleLeuGluLysArgAsnAlaSerLysProGln 
GTTGAATATCCATGTGGAAAAATACCTATTCTAGAAAAAAGAAATGCCAGCAAACCCCAA 
610 620 630 640 650 660 
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+160 +170 
GlyArglleValGlyGlyLysValCysProLysGlyGluCysProTrpGlnValLeuLeu 
GGCCGAATTGTGGGGGGCAAGGTGTGCCCCAAAGGGGAGTGTCCATGGCAGGTCCTGTTG 
670 680 690 700 710 720 

+180 +190 
LeuValAsnGlyAlaGlnLeuCysGlyGlyThrLeuIleAsnThrlleTrpValValSer 
TTGGTGAATGGAGCTCAGTTGTGTGGGGGGACCCTGATCAACACCATCTGGGTGGTCTCC 
730 740 750 760 770 780 

+200 +210 
AlaAlaHisCysPheAspLysIleLysAsnTrpArgAsnLeuIleAlaValLeuGlyGlu 
GCGGCCCACTGTTTCGACAAAATCAAGAACTGGAGGAACCTGATCGCGGTGCTGGGCGAG 
790 880 810 820 830 840 

+220 +230 
HisAspLeuSerGluHisAspGlyAspGluGlnSerArgArgValAlaGlnValllelle 
CACGACCTCAGCGAGCACGACGGGGATGAGCAGAGCCGGCGGGTGGCGCAGGTCATCATC 
850 860 870 880 890 900 

+240 +250 
ProSerThrTyrValProGlyThrThrAsnHisAspIleAlaLeuLeuArgLeuHisGln 
CCCAGCACGT ACGTCCCGGGCACCACCAACCACGACATCGCGCTGCTCCGCCTGCACCAG 
910 920 930 940 950 960 

+260 +270 
ProValValLeuThrAspHisValValProLeuCysLeuProGluArgThrPheSerGlu 
CCCGTGGTCCTCACTGACCATGTGGTGCCCCTCTGCCTGCCCGAACGGACGTTCTCTGAG 
970 980 990 1000 1010 1020 

+280 +290 
ArgThrLeuAlaPheValArgPheSerLeuValSerGlyTrpGlyGlnLeuLeuAspArg 
AGGACGCTGGCCTTCGTGCGCTTCTCATTGGTCAGCGGCTGGGGCCAGCTGCTGGACCGT 
1030 1040 1050 1060 1070 1080 

+300 +310 
GlyAlaThrAlaLeuGluLeuMetValLeuAsnValProArgLeuMetThrGlnAspCys 
GGCGCCACGGCCCTGGAGCTCATGGTCCTCAACGTGCCCCGGCTGATGACCCAGGACTGC 
1090 1100 1110 1120 1130 1140 

+320 +330 
LeuGlnGlnSerArgLysValGlyAspSerProAsnlleThrGluTyrMetPheCysAla 
CTGCAGCAGTCACGGAAGGTGGGAGACTCCCCAAATATCACGGAGTACATGTTCTGTGCC 
1150 1160 1170 1180 1190 1200 

+340 +350 
GlyTyr Ser AspGlySerLysAspSerCysLysGlyAspSerGlyGlyProHisAlaThr 
GGCTACTCGGATGGCAGCAAGGACTCCTGCAAGGGGGACAGTGGAGGCCCACATGCCACC 
1210 1220 1230 1240 1250 1260 

+360 +370 
HisTyr ArgGlyThrTrpTyrLeuThrGlylleValSerTrpGlyGlnGlyCysAlaThr 
CACTACCGGGGCACGTGGTACCTGACGGGCATCGTCAGCTGGGGCCAGGGCTGCGCAACC 
1270 1280 1290 1300 1310 1320 
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+380 +390 
ValGlyHisPheGlyValTyrThrArgValSerGlnTyrlleGluTrpLeuGlnLysLeu 
GTGGGCCACTTTGGGGTGTACACCAGGGTCTCCCAGTACATCGAGTGGCTGCAAAAGCTC 
1330 1340 1350 1360 1370 1380 

+400 +406 

MetArgSerGluProArgProGlyValLeuLeuArgAlaProPhePro*** 

ATGCGCTCAGAGCCACGCCCAGGAGTCCTCCTGCGAGCCCCATTTCCCTAGCCCAGCAGC 
1390 1400 1410 1420 1430 1440 

CCTGGCCTGTGGAGAGAAAGCCAAGGCTGCGTCGAACTGTCCTGGCACCAAATCCCATAT 
1450 1460 1470 1480 1490 1500 

ATTCTTCTGCAGTTAATGGGGTAGAGGAGGGCATGGGAGGGAGGGAGAGGTGGGGAGGGA 
1510 1520 1530 1540 1550 1560 

GACAGAGACAGAAACAGAGAGAGACAGAGACAGAGAGAGACTGAGGGAGAGACTCTGAGG 
1570 1580 1590 1600 1610 1620 

ACATGGAGAGAGACTCAAAGAGACTCCAAGATTCAAAGAGACTAATAGAGACACAGAGAT 
1630 1640 1650 1660 1670 1680 

GGAATAGAAAAGATGAGAGGCAGAGGCAGACAGGCGCTGGACAGAGGGGCAGGGGAGTGC 
1690 1700 1710 1720 1730 1740 

CAAGGTTGTCCTGGAGGCAGACAGCCCAGCTGAGCCTCCTTACCTCCCTTCAGCCAAGCC 
1750 1760 1770 1780 1790 1800 

CCACCTGCACGTGATCTGCTGGCCCTCAGGCTGCTGCTCTGCCTTCATTGCTGGAGACAG 
1810 1820 1830 1840 1850 1860 

TAGAGGCATGAACACACATGGATGCACACACACACACGCCAATGCACACACACAGAGATA 
1870 1880 1890 1900 1910 1920 

TGCACACACACGGATGCACACACAGATGGTCACACAGAGATACGCAAACACACCGATGCA 
1930 1940 1950 1960 1970 1980 

CACGCACATAGAGATATGCACACACAGATGCACACACAGATATACACATGGATGCACGCA 
1990 2000 2010 2020 2030 2040 

CATGCCAATGCACGCACACATCAGTGCACACGGATGCACAGAGATATGCACACACCGATG 
2050 2060 2070 2080 2090 2100 

TGCGCACACACAGATATGCACACACATGGATGAGCACACACACACCAAGTGCGCACACAC 
2110 2120 2130 2140 2150 2160 

ACCGATGTACACACACAGATGCACACACAGATGCACACACACCGATGCTGACTCCATGTG 
2170 2180 2190 2200 2210 2220 

TGCTGTCCTCTGAAGG03GTTGTTTAGCTCTCACTTTTCTGGTTCTTATCCATTATCATC 
2230 2240 2250 2260 2270 2280 

TTCACTTCAGACAATTCAGAAGCATCACCATGCATGGTGGCGAATGCCCCCAAACTCTCC 
2290 2300 2310 2320 2330 2340 

CCCAAATGTATTTCTCCCTTCGCTGGGTGCCGGGCTGCACAGACTATTCCCCACCTGCTT 
2350 2360 2370 2380 2390 2400 



40 



EP 0 200 421 B1 



CCCAGCTTCACAATAAACGGCTGCGTCTCCTCCGCACACCTGTGGTGCCTGCCACCCAAA 
2410 2420 2430 2240 2450 2460 

AAAAAAAAAAAAAAAAAA 
2470 2480 
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FIG 7 

GGATCC ATG CAG CGC GrG AAC ATG ATC ATG GCA GAA TCA CCA GGC 
MET Gin Arg Val Asn MET lie MET Ala Glu Ser Pro Gly 

66 81 

CTC ATC ACC ATC TGC CTT TTA GGA TAT CTA CTC AGT GCT GAA TGT 
Leu lie Thr He Cys Leu Leu Gly Tyr Leu Leu Ser Ala Glu Cys 

96 111 126 

ACA GTT TTT CTT GAT CAT GAA AAC GCC AAC AAA ATT CTG AAT CGG 
Thr Val Phe Leu Asp His Glu Asn Ala Asn Lys He Leu Asn Arg 

141 156 171 

CCA AAG AGG TAT AAT TCA GGT AAA TTG GAA GAG TTT GTT CAA GGG 
Pro Lys Arg Tyr Asn Ser Gly Lys Leu Glu Glu Phe Val Gin Gly 

186 201 216 

AAC CTT GAG AGA GAA TGT ATG GAA GAA AAG TGT AGT TTT GAA GAA 
Asn Leu Glu Arg Glu Cys MET Glu Glu Lys Cys Ser Phe Glu Glu 

231 246 261 

GCA CGA GAA GTT TTT GAA AAC ACT GAA AGA ACA AAG CTG TTC TGG 
Ala Arg Glu Val Phe Glu Asn Thr Glu Arg Thr Lys Leu Phe Trp 

276 291 306 

ATT TCT TAC AGT GAT GGG GAC CAG TGT GCC TCA AGT CCA TGC CAG 
He Ser Tyr Ser Asp Gly Asp Gin Cys Ala Ser Ser Pro Cys Gin 

321 336 351 

AAT GGG GGC TCC TGC AAG GAC CAG CTC CAG TCC TAT ATC TGC TTC 
Asn Gly Gly Ser Cys Lys Asp Gin Leu Gin Ser Tyr He Cys Phe 

366 381 396 

TGC CTC CCT GCC TTC GAG GGC CGG AAC TGT GAG ACG CAC AAG GAT 
Cys Leu Pro Ala Phe Glu Gly Arg Asn Cys Glu Thr His Lys Asp 

411 426 441 

GAC CAG CTG ATC TGT GTG AAC GAG AAC GGC GGC TGT GAG CAG TAC 
Asp Glu Leu He Cys Val Asn Glu Asn Gly Gly Cys Glu Gin Tyr 

456 471 486 

TGC AGT GAC CAC ACG GGC ACC AAG CGC TCC TGT CGG TGC CAC GAG 
Cys Ser Asp His Thr Gly Thr Lys Arg Ser Cys Arg Cys His Glu 

501 516 531 

GGG TAC TCT CTG CTG GCA GAC GGG GTG TCC TGC ACA CCC ACA GTT 
Gly Tyr Ser Leu Leu Ala Asp Gly Val Ser Cys Thr Pro Thr Val 

546 561 576 

GAA TAT CCA TCT GGA AAA ATA CCT ATT CTA GAA AAA AGA AAT GCC 
Glu Tyr Pro Cys Gly Lys He Pro He Leu Glu Lys Arg Asn Ala 

591 606 621 

AGC AAA CCC CAA GGC CGA ATT GTG GGG GGC AAG GTG TGC CCC AAA 
Ser Lys Pro Gin Gly Arg lie Val Gly Gly Lys Val Cys Pro Lys 
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AGG 
Arg 


ACG 
Thr 


966 
CTG 
Leu 


GCC 
Ala 


TTC 
Phe 


GTG 
Val 


CGC 
Arg 


981 
TTC 
Phe 


TCA 
Ser 


TTG 
Leu 


GTC 
Val 


AGC 
Ser 


996 
GGC 
Gly 


TGG 
Trp 


GGC 
Gly 


CAG 
Gin 


1011 
CTG CTG 
Leu Leu 


GAC 
Asp 


CGT 
Arg 


GGC 
Gly 


1026 
GCC ACG 
Ala Thr 


GCC 
Ala 


CTG 
Leu 


GAG 
Glu 


1041 
CTC ATG 
Leu MET 


GTC 
Val 


CTC 
Leu 


AAC 
Asn 


1056 
GTG CCC 
Val Pro 


CGG 
Arg 


CTG 
Leu 


ATG 
MET 


1071 
ACC CAG 
Thr Gin 


GAC 
Asp 


TGC 
Cys 


CTG 
Leu 


1086 
CAG CAG 
Gin Gin 


TCA 
Ser 


CGG 
Arg 


AAG 
Lys 


1101 
GTG GGA 
Val Gly 


GAC 
Asp 


TCC 
Ser 


CCA 
Pro 


1116 
AAT ATC 
Asn lie 


ACG 
Thr 


GAG 
Glu 


TAC 
Tyr 


1131 
ATG TTC 
MET Phe 


TGT 
Cys 


GCC 
Ala 


GGC 
Gly 


1146 

TAC TCG 
Tyr Ser 


GAT 
Asp 


GGC 
Gly 


AGC 
Ser 


1161 
AAG GAC 
Lys Asp 


TCC 
Ser 


TGC 
Cys 


AAG 
Lys 


1176 
GGG GAC 
Gly Asp 


AGT 
Ser 


GGA 
Gly 


GGC 
Gly 


1191 
CCA CAT 
Pro His 


GCC 
Ala 


ACC 
Thr 


CAC 
His 


1206 
TAC CGG 
Tyr Arg 


GGC 
Gly 


ACG TGG 
Thr Trp 


TAC 
Tyr 


1221 
CTG ACG 
Leu Thr 


GGC 
gly 


ATC 
He 


1236 
GTC AGC 
Val Ser 


TGG 
Trp 


GGC 
Gly 


CAG 
Gin 


1251 
GGC TGC 
Gly Cys 


GCA 
Ala 


ACC 
Thr 


GTG 
Val 
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1266 1281 1296 

GGC CAC TTT GGG GTG TAC ACC AGG GTC TCC CAG TAC ATC GAG TGG 
Gly his Phe Gly Val Tyr Thr Arg Val Ser Gin Tyr lie Glu Trp 

1311 1326 1341 

CTG CAA AAG CTC ATG CGC TCA GAG CCA CGC CCA GGA GTC CTC CTG 
Leu Gin Lys Leu MET Arg Ser Glu Pro Arg Pro Gly Val Leu Leu 

1356 1378 1388 1398 

CGA GCC CCA TTT CCC TAG CCCAGCAGCC CTGGCCTGTG GAGAGAAAGC 
Arg Ala Pro Phe Pro 

1408 1418 1428 1438 1448 

CAAGGCTGCG TCGAACTGTC CTGGCACCAA ATCCCATATA TTCTTCTGCA 

1458 1468 1478 1488 1498 

GTTAATGGGG TAGAGGAGGG CATGGGAGGG AGGGAGAGGT GGGGAGGGAG 

1508 1518 1528 1538 1548 

ACAGAGACAG AAACAGAGAG AGACAGAGAC AGAGAGAGAC TGAGGGAGAG 

1558 1568 1578 1588 1598 

ACTCTGAGGA CCATGGAGAG AGACTCAAAG AGACTCCAAG ATTCAAAGAG 

1608 1618 1628 1638 1648 

ACTAATAGAG ACACAGAGAT GGAATAGAAA AGATGaGAGG CAGAGGCAGA 

1658 1668 1678 1688 1698 

CAGGCGCTGG ACAGAGGGGC AGGGGAGTGC CAAGGTTGTC CTGGAGGCAG 

1708 1718 1728 1738 1748 

ACAGCCCAGC TGAGCCTCCT TACCTCCCTT CAGCCAAGCC CCACCTGCAC 

1758 1768 1778 1788 1798 

GTGATCTGCT GGCCCTCAGG CTGCTGCTCT GCCTTCATTG CTGGAGACAG 

1808 1818 1828 1838 1848 

TAGAGGCATG ACACACATGG ATGCACACAC ACACACGCCA TGCACACACA 

1858 1868 1878 1888 1898 

CAGAGATATG CACACACACG GATGCACACA CAGATGGTCA CACAGAGTAC 



1908 1918 1928 1938 1948 

GCAAACACAC CGATGCACAC GCACATAGAG ATATGCACAC ACAGATGCAC 
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1958 1968 1978 1988 1998 

ACACAGATAT ACACATGGAG TGCACGCACA TGCCAATGCA CGCACACATC 



2006 2018 2028 2038 2048 

AGTGCACACG GATGCACAGA GATATGCACA CACCGATGTG CGCACACACA 



2058 2068 2078 2088 2098 

GATATGCACA CACATGGATG AGCACACACA CACCAAGTGC GCACACACAC 



2108 2118 2128 2138 2148 

CGATGTACAC ACAGATGCAC ACACAGATGC ACACACACCG ATGCTGACTC 



2158 2168 2178 2188 2198 

CATGTGTGCT GTCCTCTGAA GGCGGTTGTT TAGCTCTCAC TTTTCTGGTT 



2208 2218 2228 2238 2248 

CTTATCCATT ATCATCTTCA CTTCAGACAA TTCAGAAGCA TCACCATGCA 



2258 2268 2276 2288 2298 

TGGTGGCGAA TGCCCCCAAA CTCTCCCCCA AATGTATTTC TCCCTTCGCT 



2308 2316 2328 2338 2348 

GGGTGCCGGG CTGCACAGAC TATTCCCCAC CTGCTTCCCA GCTTCACAAT 



2358 2368 2378 2368 2398 

AAACGGCTGC GTCTCCTCGC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



2408 2418 2428 2438 

AAAAAAAAAA AAGGAATTCG AGCTCGGTAC CCGGGGATCC 
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EcoRI BglE EcoRI 

X LOT r 
A fee x 



EcoRI BglE EcoRI 
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FIG. 11 



EcoRI 



BQlH 



FSK 5' FROM 0501 565 



BgtE 



EcoRI HindU 




F3HI 3* FROM pEU 2463 
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EcoRI 
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EcoRI 
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FIG. 12 
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